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<110> Folkesson, Anders 

<120> The complete sequence of the cs7 insert in Salmonella 
enteric serovar Typhiraurium 

<13 0> Complete sequence of the cs7 insert 

<140> 
<141> 

<160> 5 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 46870 

<212> DNA 

<213> Salmonella typhimurium 
<220> 



<222> (37368) . . (37868) 

<223> safA putative fimbrial subunit 

<220> 
<221> CDS 

<222> (37952) . . (38689) 

<223> safB putative periplasmic chaperone 

<220> 

<221> CDS 

<222> (38713) . . (41223) 

<223> safe putative outer membrane usher 

<220> 

<221> CDS 

<222> (41245) . . (41715) 

<223> safD putative fimbrial subunit 

<400> 1 



gatacaaatc 


tcagggtgtt 


tttatacatc 


ctgtgaagta 


aaaaaaaccg 


tatcactgta 


60 


aaagggatac 


ggtttttttt 


cgtcttcaag 


aagttccacc 


gtctatcgtg gaatctggcg 


120 


caaatgggee 


tacgcctgga 


tgacgaacag 


gatattaccg 


ccacttcttt 


cactgtcatg 


180 


gctattttga 


tcccactgac 


atttaaggcg 


cggcctcatg 


gcggtgctta 


acegggateg 


240 


ggacatgttc 


agegcagaag 


cagactgegt 


aatgttgata 


tcactcagat 


aattaeggag 


300 


aaccgccaga 


catgcgcatc 


atcactccag 


ggcatcccac 


ttctccagca 


actccaccgg 


360 


gatctcattg 


atcacctccg 


agaaccgttt 


tcccaccagt 


ctttcagcct 


ggcgtaacag 


420 


tgggatggtc 


gggctactgg 


gttcactget 


ctcaaaccag 


egacgaatge 


accgcaggcg 


480 


ttccagcgca 


tcattacgat 


cgcgaattgg 


ccccggttca gcattgtgtg ggagggatat 


540 


accgggtggc 


gcgttgacag 


gcggcacatc 


ctctgcccgt 


geeggaaegg 


cattatccat 


600 



<221> 



CDS 



1 t 



aaccgtgcct 


gcggggtctg' 


■gcactggcgc 


tggcgacgga 


gggatttccg 


gtgtggttgt 


660 . 


ctgtaccacg 


tcaggcagca^ 


gcgctaacaa 


ctgccgcagg 


cgtgaaaaat 


ctggagccag 


720 


atcgcctaat 


gtctcacgcg 


cccacacttg 


caggcgttct 


gcgctttcct 


gcgcttcgcg 


780 


aaacgccgcc 


agcggtaatg 


ccccacgggc 


ttcgaggtcg 


gccagttgct 


ggcggacgga 


840 


ttcgggggcc 


agcgcatcag 


ccgggcgggg 


agcggataac 


gcgcgctcca 


catcccgtac 


900 


ctgtaggcgc 


agggcggcac 


tgtttgacag 


cgtaataccg 


cgaatgtccg 


ccatcacgcc 


960 


ttcgtgatcc 


agcagcgccg 


ccagggcatt 


actgcgcgcc 


agcgacgcat 


cttcaataga 


1020 


ttctgcggat 


gcgccttcgc 


cggtaagcag 


ctgtggatgg 


agtgcatcag 


accagattac 


1080 


gctcagttct 


gccagttgtg 


tgagcatttc 


cgccagtccc 


tgtgcgccag 


cctgctggat 


1140 


gcgactgcgc 


agcagcagga 


tcaacacccg 


gatatcctta 


ctgcgcgtga 


gcagacggcg 


1200 


cgcatcacgt 


tcaatttccg 


gccagttcac 


ggcttccggc 


gtactgacaa 


aatcaccata 


1260 


ttgcgcctcc 


gcctgggggg 


cggcgcggct 


gaacagcagc 


aggtattccg 


gatcgtactc 


132 0 


cggatcgggg 


ccgcagggtt 


gttccgcgct 


taccggttta 


gtcagggaca 


tgtccatatc 


1380 


attactctca 


gtgggttaag 


ccgtgttcag 


gttcaaaaat 


catgccggtc 


accggacggt 


1440 


catgcatggc 


tttacctgcc 


caggccgtcc 


agccaagccg 


ctgcgcgccg 


ccgatcaccg 


1500 


caggcggtgc 


ggcatgaggc 


ttcaactgca 


gctcaatttc 


ccagcaatat 


tcaaaaccga 


1560 


tgaacgtacg 


caccagttct 


gtcagcacgg 


gcagattgtt 


gccgccgggc 


agaaaacgca 


1620 


ggtaatcctc 


cagcgtgagc 


gggccgataa 


ttagccggaa 


tttgtactgc 


atatccggta 


1680 


cggcctggcc 


gataagcgcc 


ccgttgccca 


gcaccgaaga 


ctgcgcgggc 


gtgcccagac 


1740 


gggtgatttc 


atcgttcgcc 


accgttatcc 


agtgcagggc 


gaattctttc 


accgcaaaag 


1800 


gtacgctgaa 


atagtgcgcc 


agcgtggcgg 


ccagcccgtc 


aggattgcgc 


gattcgcgta 


1860 


ccagatgggc 


ggaggctgcc 


aggcgaacat 


gatctgacag 


cgggctttcg 


gcgctttccc 


1920 


gtagatcctg 


cccgctgaga 


ctggcgatat 


aaaacgcaaa 


acggtcgtgt 


tccggtttgt 


1980 


ccagcccgcc 


accagcggac 


tgggcgctgc 


gccatgcctg 


ccagaactgc 


gtcagccagc 


2040 


ggtggtgaaa 


aatattggaa 


aaatgaacca 


gcgtcggatc 


gtgacgactc 


tctgagcggt 


2100 


tcagtgccag 


ctcggtatag 


tgcagcggca 


acgggccgtt 


tggcccccat 


agtccgaggc 


2160 


tgtacaggct 


caggtgcagg 


cgtccatcct 


gccagctgac 


ctgggcgatt 


tcccgtggcg 


2220 


caaaggtcat 


cgtcggcgtc 


tgtcccagac 


ggaatttttc 


catccgtggc 


ttgccagata 


2280 


cttgcctgcc 


ggagtatcac 


agagctgggc 


atccacgcgc 


cgcatcaggt 


tcaggaatcc 


2340 


gtaacgccag 


ggggttttct 


gcgcctggtt 


gagagcgccg 


gtcatacgct 


cccccgttgc 


2400 



ccggttctga ccggccagg 




tgacatgc ccgcgttgca tcgagtgcag 



16 




tcatctgc 2460 



y cty day y ccil 


taatggaaac atggcgggca 


acacagcgcc 


ccagcaccag 


accgaacagg 


2520 


t- a arret a ft" era 
Ldayyav, cy ci 


taccggaaaa 


tccttcttcg 


uccacyytca 


gctcgcaact 


gacgccccgg 


2580 


0 d i— o.y c* \^ v_ ex. 


acaggccgga 


gccgggcagg 


Cyycyyy uca 


ccggggtggt 


tttgcagcca 


264 0 


dLCdyyCLyC 


gcacctggcg 


cgactgcggg 


cxgtcgcgag 


ccgggataaa 


cagattcagc 


2700 


agaccgcgca 


gcgcctggcc 


gccggtgcgg 


tgatccagat 


cggccagcgg 


cagataatta 


2760 


aacgacaact 


gccggatcag 


ccgccaggcc 


atttcgcgtt 


cagccagcgg 


cggctgcggc 


2820 


gggcgcggcg 


gtctgataag 


acccacgccc 


gccaccggaa 


tcgccgcatc 


tacggtcaga 


2880 


tcatcccggc 


cattacgtgg 


aataaggcag 


ggcagatcgc 


ggttagtcac 


cattgccgtg 


2940 


acggtgatat 


ggcgcagatt 


ttccgggtag 


ggcgcttcat 


gctgatcaac 


cagcgagagg 


300 0 


aagacttccg 


agccggtata 


gggggttcgg 


gtgccatagc 


ggcgggcgtt 


ttctgacgag 


3060 


cggcgcggtt 


cacgacgcag 


tgaaaaataa 


cgcccgtggt 


tgccttcgtc 


attattacgg 


3120 


gtgtgataca 


gcggacgaaa 


aatcatcttc 


cgtgtggttt 


ccgcttccag 


gccttcgact 


3180 


tcctgaacag 


aaaacacctc 


gtaatccagc 


ggacgggtac 


gatccaccac 


cagatgctgt 


3240 


tccgtcacgc 


tgtgagtgac 


ttcaatccgg 


gtggtggtgc 


gaggaagcag 


gttgatcacc 


3300 


ggcgtacaga 


acaggctgaa 


ctgtgcagcg 


tccgtctgat 


gaatcagcca 


gtccggcggc 


3 3 60 


aggcggttaa 


gcagtatgac 


aatttccgcc 


acattgccct 


gcaccttttg 


taacccggca 


3420 


gacaatccgg 


tcggggtgaa 


gaagtaaaac 


cgttccggac 


aggcgaaaaa 


ttcatgcagc 


3480 


agattatggc 


cgtgaaacac 


gttccaggcg 


agcggtagca 


gcccctgccc 


tggctccagc 


3540 


ccttcgtgcg 


ccaccgggtg 


ttgaagattc 


acattcagtt 


cgccgtcaaa 


gtgaccgggt 


3600 


tcaccggcca 


gtgtggcgac 


ggcgctggta 


tgtagcagct 


caaacaggtg 


tgacgcaatg 


3660 


cgttcttcgc 


cgcagaggta 


aaagggcagc 


cgtgccggac 


cggccagctc 


gctgaaagtc 


3720 


agttccccga 


aggttcgcag 


ggtgatgcgc 


aatgccccgg 


cgacatgaat 


attaggcggc 


3780 


agatagcggt 


gcagggcggg 


catatccggc 


ggcgcggcgg 


tcaggcgtac 


ctcctcgatg 


3 84 0 


gacagcggcc 


acagcgtgac 


gtcctggctg 


ctgcgaaact 


ggcaggcggt 


X_ 4_ X_ l_ *m*mm v-^ >-« 4— 

atccccgcct: 




cccgggaugg 


gggaaacgaa 


cgcggtatcg 


cgcggcacgg 


tgaccccctt 


cgccaggncg 


i q c r\ 


ccttcctgcg 


cancgggaca 


cagccc cacc 


accgccaccg 


atggcgtggg 


ggtgacgcaa 




ttggggctga 


cgacttccag 


taaccgctgt 


gtgaagcggg 


gaaactcggc 


gtcaattttt 


4080 


agctgagtgc 


gggcgctcag 


aaagctgaac 


gcctcgatca 


tgcgttccac 


atacgggtcg 


4140 


gcaatatcgg 


ttccctgcat 


ccccagtcgg 


gcggcaattt 


tgggatggag 


ggtggcgaac 


4200 


tcagcaccgg 


tctcccgcag 


gtagctcagt 


tcgcggttgt 


aatactccag 


tagccgtgga 


4260 



V 4 



tccatgaata 


atgccctgta 


ttaaaagaac 


gtaatgcggc 


tcagctccat 


atccagcgcg 


4320- 


ctgcgtacca 


gaaactccgt* 


gggatacggc 


tgcgtcagaa 


tttgtccgcg 


aatttcaaac 


4380 


tgtagcgtgt 


tatagctacc 


ctgccgggtt 


ttatcgagca 


agggcgtgac 


cctgagtgtg 


4440 


gcggcgttca 


gdcggggttc 


gaaacggata 


atggcgcgcc 


ggatcgcctc 


gctgatatcg 


4500 


tcccacttat 


gctcatacat 


aaagctgccc 


gccagcggcg 


gcaggccata 


gttgagcact 


4560 


gacgccgccg 


cctgcggata 


gcgccgggcg 


tcgatgtcac 


cctcgtggct 


aatggtattg 


4620 


agcaaaaagg 


agagatcccg 


gcgaatgatc 


tccttcagtt 


gtaccgg.cgt 


gacgctgata 


4680 


tcccggtcaa 


ttttctgata 


cggagcattg 


tcacacagcc 


gatcaaacag 


cgtgggcagc 


4740 


aggtggttag 


cgggtgtaaa 


acgagacgtg 


ctcatgcgcc 


atcgttttcc 


tgagcatgaa 


4800 


aggtacaatg 


ggccatgtcc 


agcaggctga 


tatcgccgtg 


gctggtcagc 


cacactttct 


4860 


gccccagcgc 


ccgcacggtg 


gtttcgccgg 


ggccgtcctg 


ccaggcggtt 


tccctgcaca 


4920 


gacgcagggc 


gtcggatgca 


ctttccgaac 


cgctgtaacg 


ggtaaagagc 


caggcgccgt 


4980 


gcgtatcgcc 


attcaccagg 


gtgatattaa 


cgggtttcca 


cagcagatca 


gtcaggcgcg 


5040 


tcggttgcgg 


cgattccagc 


gagcgtattt 


gcgaaaacgg 


cagccagata 


tacacgccgc 


5100 


cggtgactag 


ctcaagtacc 


gggccaaggc 


gggaatcgct 


gtcgctcgcc 


cagtcaaatg 


5160 


cgccgccgtt 


ccactgcccg 


cccgtgtctg 


ttatggcttc 


cagtgcggta 


ttacggtgtt 


5220 


tatcaacctc 


accggtatcg 


tcatgacagg 


cgagtgccgc 


cagcagtgac 


tccacccaaa 


5280 


cgggctgcgg 


cagaagaaaa 


ccgggtcgtt 


gttcaccctg 


aaaaacggtg 


tggcggaaca 


5340 


tttcgcagcg 


aaccagctcc 


cggtacagcc 


gggcctcctg 


ggtataattg 


gcctccatcc 


5400 


tggcgcatag 


ctgaagctgg 


tgtagcgccc 


gcgaccagtc 


tccggccaca 


cacagcaact 


5460 


gaaacaggct 


gtggcggcag 


agcgctttcg 


ccggattttc 


ccgaacctgc 


tgctccgcca 


5520 


tctgaatccc 


ctccgcaata 


gagtattcct 


gtatcagcgc 


ggacagggta 


gcaggaagcg 


5580 


tgtcagtttt 


tttcatgggc 


ggtatttcca 


tttttctgtg 


tcggagtgat 


tcggtagtgg 


5640 


ctatcgatgc 


caataatacg 


atgctcgcgg 


cgggtcagat 


cgggcagtat 


ctcgctatgt 


5700 


gcggttttcc 


cccccagttc 


cggcgagagc 


agttgcagga 


tatcgggcat 


tgattccatc 


5760 


gccagccagt 


gcatttcacc 


ctcaccggtc 


gtgtccagcg 


tatccagaat 


ggcatcaata 


5820 




ctaccaccat 


atcctataac 




cacccttttt 


atcatataac 


5880 


gacgtcaaat 


. cctgggtgac 


agcgtcttca 


tttcggggaa 


acggtttagc 


acggaggggc 


5940 


ttttcctgcc 


cggagggccg 


caatgcctgc 


tgatattcct 


ggtaaagctg 


gtgtagtgtg 


6000 


ctatccgctt 


catggccagt 


gtaagaggat 


gcggtttccc 


ggacgggaat 


aatatcgaac 


6060 



gggttctgcg 



gattgagct 




ttgttta 



aaccagtcca gatccagg 



18 




agcaacg 



6120 



gattcagatg actgaaccaa ctgaggaaac gaaacatcgg gacgggactc atcattggtg 6180 
cgggcaaggc accaggatga cagaccccat tcaatcgtat ctccatcatt cagccgcata 624 0 
cggtgatgag gttccatcac ctgctcattg acggcacagc acagttcatc agagtgatta 63 00 
acaatccacc acgcctcttc gtggcgaaca agactgagcg cgatatcgtc tttcccatgt 63 60 
tgctcaaaaa gcggaatgta gggggcggcg gcagtaaata cgatgtatgt ccctgcggga 6420 
tactgtgcgt tatccgttat accctgcgac tgtattttcc gtaattccca catgtcttgt 6480 
ttcattatgt cccttaaatg tatttatttt tctggaggaa acgtttaggg agttttaatt 6540 
cataaataat tttaactaaa tttataggga gtcattattg atgacaccct ttttattatt 6600 
ttcgtcgtga atgcattgtt gtatgcatag atgtcttttt tgaaatatta tttcttttaa 6660 
ttctgcaatt gcgattttta tcttgctgtt cattgattat ttaatattaa cggcttagtt 6720 
gctaattagt ccctgatctg tatcattgtt ttgtttcgat attttttcga ggctatcaat 6780 
aaagaattgt ttatatattt atatgcattg atgcattatt tttatgaatt tttatgtcac 6840 
aaggcataac acatggaaac tcctgtttca cgcagtgcgt tgtatggaaa actggccggc 6900 
ccactattcc ggtcgctgga atcggcaacg gcattttgca aactacgctc taatccctgg 6960 
gttgagctga ctcactggct gcaccagtta acacagcagc ccgataacga tattctccac 702 0 
gttcttcggc attaccagat ccctctttct gatgtggaga aagcgttact ccggcaactg 7080 
gatatgctgc ccgccggggc cagcgccatt agtgattttt ctcaccatat cgatctcagc 7140 
gttgaaaagg cctggatgct ggcgagcgtc cgttacggcg ataacaaaat tcgcagcggc 7200, 
tggttgctgc tggccttgtt gaccacgcca gaactgcgtc gggtactgag cagtatctgc 7260 
gcgccgctgg ccacgcttcc ggttgatgaa ctgacggaaa tactgccctc gttgatcgaa 7320 
acatcgccgg aagcgcagga gcgcccttac gacggctccg gcctggcatc agccattccc 7380 
ggtgaaagca gtcaggcgat tcccaacggc gggcaggacg gtaaatccgc gctggcaaaa 7440 
tactgtcagg acatgacggc acaggcgcgc gacggcaaaa tcgacccggt gacggggcgt 7500 
gagcatgaaa tccgcaccat gacggatatt ctgctgcgcc gtcgccagaa taatccacta 7560 
ctgactggtg aggcgggcgt cgggaaaacg gcggtcgtcg aaggttttgc cctcgcgatt 7 62 0 
gcgcaggggg aagtgccgcc cgcgctgcgg gaagtacggc tactggcgct ggacgttggc 7 680 
gctctgttgg ccggagccag catgaaaggc gagtttgaat cgcgtctgaa agggttactg 7740 
gaagaggccg ggcgctcgcc gcagccggtt attctgtttg ttgatgaagt tcacactctg 7 8 00 
gtgggcgcgg gcggcgcatc cggcacgggc gatgccgcta acctgctgaa accggcgctg 7860 
gcgcgcggca ccctgcggac tatcggcgcc accacctgga gcgaatacaa gcgccatatt 7920 



gagaaagatc 


cggcgctgac 


ccgtcgtttt 


caggtgttgc 


agattgeega 


aceggaagaq 

33 3 3 


7980- 


atccccgcaa 


tggaaatggt 


gcgtggtctg 


gtggatacgc 


tggaaaaaca 


ecataaegta 


8040 


ctgattctgg 


atgaggcggt 


acgtgcggcg 


gtacagcttt 


ctcaccgcta 


cattcccgcc 


8100 


cggcagttgc 


cggataaggc 


catcagcctg 


ctggataccg 


ccgcggcccg 


cgtggcgctg 


8160 


acgctgcaca 


cgccgcctgc 


cagegtacag 


ttcctgcgcc 


agcagctaaa 


ageggeggaa 


8220 


atggaacggt 


cgctgttgca 


gaagcaggaa 


aaaatgggga 


ttcagtcaga 


tqaqcqqcqc 


8280 


qatqcqctga 


tggcgcgaat 


tttctcgctc 


aacaatgaac 


tqactqeat c 


cqaatcccac 


8340 


tqqcaqcqqq 


age tggaac t 


qqtacatacq 


ttgeaggaac 


tacqtctcqc 


aaaatctaat 

W» W ^4 v> 


8400 


qctqatqaca 


aaaccacgct 


gcaacaggcc 


qaaacqqcqc 


taaqaaaata 


Qcaoaacaac 

W V* *J \<» W 


8460 


qcqccqqtqq 


tgttcccgga 


aqtcaqcqcq 


qcqqttqtcq 

3 W 33 ^^-3^-^*3 


cqacaattqt 


caccaactacf 

\w* v-» ^ t4 z3 


8520 


accggtattc 


Ctqctqqqcq 


catqqtqaaa 


qatqaqqeca 


qccaaqtact 


oaaac tercet 


8580 


qcccqactqq 


cgcaacgcgt 


tacegggcaa 


qaccrcrcaccrc 


tacrcacacia t 


tcratcraacat 


8640 


attcagaccg 


ccaqqqcqqq 


actqqqcqat 


ccacacaaac 




otttatocta 


87 00 


accqqqccat 


ccqqtqtcqq 


taaaaccgaa 


accococtaa 


car taciccrcra 


aactatctac 


8760 


aocQOccraQC 

as^jj^a "3 w 


aqaacctqqt 


aaccatcaat 


ataaacaaat 


W ~H %2 01 Z3 13 


tcacaccatt 

^— V^p d « W V-' V* <w ^ 


8820 


tccacactaa 

k« Wfc ^4 V* ^4 U 


aaoacqcqcc 


qcccqactat 


crtcrcrac t a. ta 


a(™*cr/5 oner t - era 


t"ahar has ca 


8880 

O W \J 


q a aq c qq t q c 


gtcgccaccc 


ctqqaqcqta 

w wist 


qtcrctqctca 


a ccf a era tccra 


aaaacrcacac 


8940 


catgacgtcc 


acgaactctt 


Ctatcaqqtq 


tttqacaaqq 


qtaqqatqqa 


aaacqacaaQ 


9000 


ggtacacatg 


tcgatttcaa 


aaacaccacg 


c tat tact ca 


ccaccaacat 


qqattccqac 


9060 


ctcatcagcc 


aqatgtqtga 


agatccggcc 


ttaatgcccg 


atqc tacqqq 


gcttaaagag 


9120 


qcgctaatgc 


eggaattgeg 


caagcatttc 


ccggcggcat 


ttctqqqccq 


cgtgacggtg 


9180 


atcccttacc 


tgccgctgga 


egaaaegteg 

3 3 3 


cgtqqcqtqa 


ttgcccgtct 


gcaccttgac 


9240 


cqqctggtgg 

3 3 3 3 3 3 


cgeggatgag 


tgaacagcac 


gqcqtgacqc 


tgaegtatag 


cgaggaactg 


9300 


gtcgcacata 


ttgtggcgtg 


ctgtccaatg 

•J 3 


catgaaaegg 


gcgcgcggtt 


gctgattggc 


9360 


tacatcgaac 


agcacattct 


gccacgactg 


tcgcgctact 


ggttgcaggc 

3 3 3 3 3 


catgaeggaa 


9420 


aaagccgcga 


tcaggcagat 


tgatategge 


gttaatggtg 


atgagcagat 


tgtttttgag 


9480 


atcgtttgct 


gaaaccggcc 


gttcgaagtg 


tccgtagtgc 


gattttaaaa 


actgtacegg 


9540 


tataccgctc 


cccttgcggc 


aaccagttga 


ctaaaaagga 


aatgaaggat 


tatggctatc 


9600 


aacaatagcg 


cgcagaaatt 


catcgcgcgc 


aaccgcgcgc 


cgcgcgtgca 


gattgaatat 


9660 


gacgtagaga 


tttacggttc 


cgagaaaaaa 


ategagctge 


cgttcgtgat 


ggcggtgctg 


9720 



gccgatctgg 



ccgggaaacc 




tgaagaa 



ctgccgccgg 



20 



tgacggat 




attcctc 9780 



gatattgata 


ttgataactt 


caatgagcgc 


atgaaagcca 


ttgcgccgcg 


cgtggcgttc 


9840 


gctgtgccga 


atacgctgac 


gggtgaaggt 


cagttgatgg 


tcgatatcac 


gctggaaaat 


9900 


atggacgact 


tttcgccggc 


acagattgcc 


cgcaaggtgg 


acgccctgaa 


ccagttactg 


9960 


gaagcccgca 


ctcaactggc 


gaacctccag 


acctacatgg 


atggcaaggc 


gggggcggaa 


10020 


aatctggtca 


ataaactgtt 


gcaggacccg 


actctgctga 


aaacgctggc 


gaatgcgccg 


10080 


aaatcggccg 


ctacccagca 


agatgtgtca 


gcggataatg 


aatcagcgga 


ataacgtcga 


10140 


atttttaagg 


aattttcatg 


gcaaacagta 


atatgcaggc 


aaccgacgcg 


gttgctcagg 


10200 


ataccgcctc 


cgcatccggt 


gaatttgatg 


cgttgctgaa 


tcaggccttc 


cgacccaaga 


10260 


ctacccaggc 


ggcaaaagcc 


gtggaagccg 


cggtgcagac 


gctggcgaac 


acgatcaccg 


10320 


tcagcgatga 


cgcctataaa 


agcatcagcg 


ctattattgc 


gcagatcgac 


tttaaactga 


10380 


ccgaacagat 


caaactgatc 


ctgcaacatc 


ccgactggca 


gaagctggaa 


tcctcgtggc 


10440 


gcggtatgga 


gcatctggtt 


tacaacaccg 


agaccgacga 


aaagctgaaa 


attcgcttca 


10500 


tgaatctgtc 


aaaagatgaa 


ttgcggcgca 


acatgaagcg 


ttacaagggc 


atcgcctggg 


10560 


atcaaagccc 


gatgttcaag 


aaactgtatg 


aagccgaata 


cggccagtta 


ggtggcgaac 


10620 


cttatggctg 


tatcattgcg 


gattactact 


tcgaccatac 


accgcccgat 


gtggatctgc 


10680 


ttggctctat 


cgccaaagtc 


gccgcgtcgg 


cccatgcgcc 


gtttattgcc 


ggggcttccc 


10740 


cctcggtact 


gcaaatggac 


tcctggcagg 


aactggcgaa 


tccccgcgac 


ctgaccaaaa 


10800 


tcgtcaccca 


gaacctggaa 


tatgcgccgt 


ggaactcgct 


gcgggctagc 


gaagactccc 


10860 


gttatattgg 


cctgacgatg 


ccgcgttttc 


ttgcccgcct 


gccgtatggc 


gcaaaaacca 


10920 


acccggtgga 


cgagtttgat 


tttgaagaag 


atgcggatgg 


ttctgaccat 


accaaatacg 


10980 


tctggagcaa 


cgcggcctac 


gcgatgggcg 


taaacatcaa 


ccgttccttc 


aaacactacg 


11040 


gctggtgtac 


gttgattcgc 


ggtgtggaat 


caggcggtgc 


ggtggaaaat 


cttccctgcc 


11100 


ataccttccc 


gactgacgat 


ggcggcgtgg 


acatgaaatg 


cccgaccgaa 


atcgccatct 


11160 


ctgaccgccg 


cgaggctgaa 


ctggcgaaaa 


acggttttat 


cccgttgatc 


caccgtaaaa 


11220 


actcagacta 


tgccgccttt 


atcggcgcac 


agtcgctgca 


aaaaccacag 


gaatactacg 


11280 


atccggacgc 


gacggccaac 


gctaacctgt 


ctgcccgtct 


accgtacctg 


ttcgcctgct 


11340 


cgcgcttcgc 


tcacttcctc 


aaatgtatcg 


tccgcgacaa 


aatcggttcc 


tttaaagagc 


11400 


gtgaggatat 


gcagcgctgg 


ctaaatgaat 


ggattatgaa 


ttatgtcgac 


gccgatccgg 


11460 


tgaactcctc 


gcaagaaact 


aaagcccgtc 


gtccgctggc 


tgccgctgaa 


gtagtggtgg 


11520 


aagaggtcga 


aggcaatcca 


ggttattacg 


acgcgaaatt 


cttcctgcgt 


ccgcatttcc 


11580 



agcttgaagg 


gctgacggga 


tcgctgcgcc 


tggtgacaaa 


actgccgtca 


gtgaagcagg 


11640. 


gcaatgcctg 


atatatattrt 


tgtgaatgtt 


taagcgagtg 


aagtcagaga 


agatagagaa 


11700 


tataaagagg gatatgaaga 


aaagaatttc 


gtctcgccca 


cggtctcgta 


aaggtggggt 


11760 


auu l> d cl i~ a l. 




cgaatgccag 


taacaatgcc 


gaagcttttt 


atatcattga 


11820 


frHr a *T<~T S 2 3 t* 3 
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ccataagccc 


aacttttcat 


ctgttacctg 


gtattgttct 


11880 






tagcctggga 


agtcagttgc 


ccggctgtta 


ttgatactca 


11940 


gcc si uccgcc 


gtgagcccga 


agtctgatgt 


cccagcggcg 


tggcagcttt 


ctccccgata 


12000 


tatgtcgcgt 


ttatggtcaa 


gtagtattgg 


ggtaacgcag 


ggtaaacctg 


aaaacctgat 


12060 


ggatctcaaa 


ccagagacta 


aaaaagtaaa 


cggtgaaaat 


tggtctgtat 


gggaaacaga 


12120 


acgtggtagc 


ga taaagaaa 


ccgatcgcta 


ttgggtttcg 


tgtatttatg 


gtcatgaaca 


12180 


gatatggttg 


acgcaaccaa 


tacctgcttc 


ttctactcgc 


tgtaagactc 


gtaattttga 


12240 


gggatcgcca 


gaagaccagt 


ctgtatcttt 


tatctgtaat 


tagcgatttg 


agacgtgaaa 


12300 


atttcagtac 


aggttatggt 


ttttattatc 


ggaagttatg 


aagcattatt 


tatatgcatt 


12360 


aaataatgca 


aattcataaa 


ataactaaat 


acattatcgg 


taccggaaaa 


atatacagtc 


12420 


ctctgttctc 


ctgaagttat 


tggagaagga 


ttctgtacgg 


caatgattta 


tctataaaca 


12480 


aaaagatata 


gataaaatca 


ggtttatttt 


aagtaaaact 


taataaggat 


ataaaaatgg 


12540 


cttatgacat 


ttttttgaaa 


attgacggca 


ttgatggcga 


gtcaatggat 


gacaaacaca 


12600 


aaaatgaaat 


tgaagtactg 


agctggcgct 


ggaatattca 


tcaggaatcc 


accatgcacg 


12660 


ccggtagcgg 


ccccggcccc 


ggtaaggtct 


ccgtcaccaa 


cctggatttt 


gatcactata 


12720 


ccgaccgcgc 


cagcccgaac 


ctgttcaaat 


actgcgcctc 


cggcaagcac 


attccgcagg 


12780 


ccatcccggc 


Lduyt-y Lady 


gctggcggca 


atccgctgga 


gtacctcaag 


tataccttca 


12840 


ccgacctgat 


tgtcgccgtg 


gtttccccga 


gcggcagcca 


cgatggtgaa 


atcgcctccc 


12900 


gtgaaacggt 


ggagctctcc 


ttcagcaccg 


tgaagcagga 


atacgtggtg 


cagaaccagc 


12960 


agggcggcag 


cggcggcacc 


atcaccgcag 


gctacgactt 


caaggccaac 


aaagaaattt 


13020 


aacggctgtt 


tttccggcca 


gatgttatgt 


ctggctggtt 


ttattgtttt 


gattttaaag 


13080 


gaatttacag 


tgaataaatg 


gcgtaacccc 


actgggtggt 


tatgtgcggt 


agctatgcct 


13140 


tttgcactgc 


tcctgctttc 


cggatgcggc 


agtagcgatt 


cgctacttga 


cccctaatcg 


13200 


cagcggcctg gcctgagcgt 


gaaagcgttt 


tacaaggtga 


attctgacaa 


tcagaagaaa 


13260 


gcggcgtcca 


tgaagatacg 


tgttgagaat 


taatgaccta 


cacagaattt 


ttagaggtta 


13320 


agcaaaatga 


acagaccttc 


attcaatgaa 


gcgtggttag 


cttttaggaa 


ggtgaatcat 


13380 




22 




tccgtcgctg atgtgggtag^Ptattggt ggaaacgttg ggaaaaataFwStggtggt 1344 0 
tattttcaaa atgcctgccc tattcgaatg agctatgttt tgaatgcgac agggttccca 13500 



cgcgtgaatg 


atatgattga 


ttatcttact 


catactatgg 


gcaagcctga 


tcttattgtt 


13620 


aataatccga 


aacagagtga 


ctttatcggg 


aagaaaggaa 


ttatcgtagt 


aaaagggcat 


13680 


ggctggagca 


atgccagagg 


acacgttaca 


ttatggaatg 


gcagtatctg 


ttcagatcag 


13740 


tgccacttat 


taaatgaccc 


agataatgga 


ccatttgttc 


ctgaagttgg 


gacactgtgg 


13800 


atactgccgt 


gaaatggtta 


atattagtta 


ctttcagcat 


aagtggaatg 


ctggtatggc 


13860 


agccatcctt 


tgcacaagaa 


gcattgacca 


cacaatattc 


acagtcggaa 


ctcttaaaaa 


13920 


attgggcgct 


gagtcattgt 


ctggcattag 


tatacaaaga 


tgatgtcgtt 


aaaaacgatg 


13980 


ccagagctac 


ggccagtgct 


taccttgaat 


atggtaaaca 


atctgtggag 


atttaccatg 


14040 


aaattgatga 


gattgcgaaa 


aaatattcag 


ggttgaaata 


taacggttcg 


atatcatcag 


14100 


attttaatac 


catgaagtgc 


atagatttta 


tccatgacag 


ggaattaaat 


gaattaatta 


14160 


aaaggcgtgt 


cgagaagtaa 


aattcaaaga 


tattaaaagc 


atacgttctc 


ttgctctgat 


14220 


gaatttatgg 


gtaagaaaga 


gctgtacagg 


aatagttaat 


ctgttcacct 


aataaagcag 


14280 


ataaatcagg 


gcttaattta 


ggtagttaaa 


aggatagtag 


atatgtctta 


tgacattttt 


14340 


ctgaaaattg 


acggcattga 


cggcgagtca 


atggatgaca 


aacacaaaaa 


tgaaattgaa 


14400 


gtactgagct 


ggcgctggaa 


tattcatcag 


gaatccacca 


tgcacgccgg 


tagcggtctc 


14460 


ggttccggta 


aggtctccgt 


cactaatctt 


tcatttgaac 


attacatcga 


tcgcgccagc 


14520 


ccgaacctgt 


tcaaatactg 


ctcttccggt 


aagcacattc 


cgcaggccat 


tctggttatg 


14580 


cgtaaggctg 


gcggcaatcc 


gctggagtac 


ctcaagtaca 


ctttcacaga 


tctgattatt 


14640 


gcaatggtat 


cgcccagcgg 


aagccaggga 


ggggaaattg 


cgtctcgcga 


atcaattgaa 


14700 


ctctccttca 


gcaccgtgaa 


gcaggaatac 


gtggtgcaga 


accagcaggg 


tggcagcggc 


14760 


ggcaccatca 


ccgcaggcta 


cgacttcaag 


gccaacaaag 


aaatttaacg 


gctgtttttc 


14820 


cggccagatt 


tatatctggc 


cggatttatt 


attttgattt 


taaaggaatt 


tacagtgaat 


14880 


gaatggcgta 


accccactcg 


gtggttatgt 


gcggtagcta 


tgccttttgc 


actgctcctg 


14940 


ctttccggat 


gcggcagtag 


cgatgcgtta 


cctgacctcg 


aatcacagcg 


actcgacctg 


15 0 00 


agcgtgaaag 


cctccgataa 


ggtgaatcct 


gacaatcaga 


agaaggccgc 


gcccattgag 


15060 


atacgtgttt 


atgaactgaa 


aaatgacgcc 


gctttcacga 


cagctgatta 


ctggtcgctc 


15120 


catgacaacg 


acaaatccgt 


ccttaccgac 


gatttagtgc 


gtcgcgacag 


ctttattttg 


15180 


cgtcccggcg 


aagagaaaaa 


actgcgtcgc 


ccgctgaatg 


cgcagaccac 


ggcaatcggc 


15240 



atagcccgta actctccgta 



tgcaaaggtt agtggtgccg 



ataataaatt 



ctatatttat 13560 



gtactggccg 


gataccgtaa 


cctggccaaa 


tcggtctggc 


gggtaaccta 


caaaatcccg 


15300 


gaagccccgg 


aaaaagcctg 


gtacagcagc 


ttcatatcgg 


ggaaaggaaa 


agtgcagttg 


15360 


gaggcggaac 


tggaacaaag 


cgccattgta 


attacggaac 


gggataaatg 


aattatgagc 


15420 


tggaatgacc 


gcgtagtctg 


gagtgaagga 


caatttttac 


tgccgcagat 


gtttcagcag 


15480 


caagagcgtt 


atctggaaca 


cgtcatgcat 


taccgcagcc 


tgccgctgac 


cccctttttc 


15540 


tggggattca 


gccactacaa 


tattgatggc 


gaagcgctga 


acatcggtaa 


actgatactg 


15600 


aaagaggcat 


cagggatttt 


tcctgacggc 


acgccgttta 


acgcaccgga 


ccacaccccg 


15660 


ctgccgccgc 


cactgaccat 


tctgccggag 


cacctgaacc 


agcagatttg 


tctggcggta 


15720 


ccggtacgcg 


cgccgaacag 


cgaagaaacc 


acgtttgaca 


ataacccgga 


atcattggcg 


15780 


cgtttctcgg 


tacatgaaca 


cgacatccgc 


gacgccaact 


cgctgggacg 


tggcgcgcag 


15840 


ttattacagc 


tcagtcatct 


gcgcctgcgg 


ctgctgccgg 


aaaaggcggt 


gacgggcgcc 


15900 


tggattggcc 


tgccgttgac 


ccgcatcacc 


gggttgaacc 


ctgacgggcg 


gatagatatc 


15960 


gaccacgacc 


tgatcccgcc 


catcattaat 


tatcaggcca 


gttcactgat 


gtgtacctgg 


16020 


ctgtcgtgga 


tcaacgatct 


catccggatg 


cgggccgatt 


cgctggcgga 


acggctgacc 


16080 


ggcagcgaca 


accacggcca 


tgaagcagcg 


9^99tctccg 


attacctgct 


gctgcaaatt 


16140 


ctcaatcgct 


ttgagccgct 


gctgactcac 


ctggcgaaaa 


ccccgctggc 


cccggaggtg 


16200 


ctgtaccgct 


acctgtccga 


actggccggg 


gaactctcca 


cctatgtgcg 


tccacaaacg 


16260 


cgacggcccg 


ctgaatacaa 


agagtacaaa 


cacctgacgc 


cctatgccgg 


gttgaaatcg 


16320 


ctggttgatg 


aggtgcagtt 


cctgctgaac 


gcggtactga 


tccggggcgc 


gcagcgcatc 


16380 


gagctgaaag 


aggggactta 


cggcatcctg 


aatgcggtgg 


tggccccttc 


cgatcttgcc 


16440 


gatttcagca 


cgctggtact 


ggcgataaag 


gcttcaatgc 


cgaccgatgt 


gctactgcaa 


16500 


cattttgccg 


cccagaccaa 


aatcgggcca 


tccgatcgcc 


tgccggaact 


gatccgctcg 


16560 


catctgccgg 


ggctggcttt 


gcaggttctg 


cctgtaccac 


cgcgccaaat 


cccgtttcag 


16620 


gccggataca 


tctattacga 


catccgccgc 


gagggagcat 


tgtgggaaca 


cattgcccgt 


16680 


tacggcggga 


tggccatgca 


taccgccggg 


gaatttccgg 


ggctggagac 


agaactgtgg 


16740 


ggagtgcgcg 


ataaatgaca 


gacagtaccc 


tgacgccgcc 


agcggcggat 


atgatgtcct 


16800 


ttttgtccac 


cacgccggaa 


cataaggaca 


gtgaatatga 


aacgccggta 


cacaccagcc 


16860 


agcgcacgga 


actcaatgtc 


atcgttgaag 


acggtccgga 


cagcaaactc 


cggctggctg 


16920 


aaatcagcgc 


ggcggctaac 


ccgttgctcg 


ccgctgcccg 


gcctttattg 


tgcgctctcg 


16980 


cagccatgcc 


cgctaaactg 


gatgcggccc 


tggtagagcc 


ttaccgtaat 


ctgctggtac 


17040 



gcgagatgca 



tctgtaccag 




ttatgcg atcaggcgaa cctgcggcgc 
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cacgtac 17100 



tggcggtacg ttactgcctg tgtacggcgc ttgatgaagc cgccaataac acaacctggg 17160 
gacggcgcgg cgtctgggcc ggaaaaagcc tgctggtaac atttcatggt gaaagcgaag 1722 0 
gcgggataaa acttttccag atcatcgggc gtctggcggc cagcttccag gagcatggca 17280 
acgtactgga ggttatctac cacctgctgg ggttgggatt tgaaggccgc tacagcgtgc 17340 
agccagacgg gcgtaagcaa ctggacaata ttcgccagca actgctgaca cagctttcac 17400 
agcgtcgcga tccggttatg cccgcgctct cgcctgactt tcagggggcg ataagcggac 17460 
gactgcggcg gatgcgccgg gtgccggtct ggctgagcgc cgggatagcc ctgttggcga 17 520 
tgctgacgct gtttggcctt tacagccacc ggatggatgt gcagaccgtc accgtacaac 17580 
agcatattga tgcgattggt ataaaactgc cgccgccgcc tgtgccggtt cataagctgc 17640 
ggctgaaaat cctgctggca aacgaaatcg cccgtggcct gctgaccgtg gacgaagatg 17700 
accagcacag tagggtggtc ttccgtggcg acgccatgtt tgtgccggga cagaaaacgg 17760 
tgagtgacgc aatccggcca gtgattaaca aagcggcgcg ggaaatcgcc cgcgtgggcg 17820 
gcgcagtcac tgtaacgggt cacactgaca gccagcccat tcattcggct gaattcccat 17880 
ccaacctggt actgtcggaa aaacgggcgg cggaagttgc ggcgttgctg acctccggcg 17 940 
gcgtacctgc cggacgggta catatcgtcg gcaagggcga tacggtgccg gtggcggata 18000 
acggcagtaa agccgggcgg gcgaaaaacc gtcgggtgga aattctggta gtggagtgaa 18 060 
tgaatgatga aaaaatcaac ctatgatgtg tctcatcatt cggcagtatg tggcgtgacg 18120 
ggggattatt atcggatctc agcgacatat cacataacac gatctgttcg tgtttttttg 18180 1 
atcatcttat gttgcctgtt atccggtggc gcttttgccg gatccccgat taacgcagga 18240 
ttcatttccc ccgataatgt caacctcagt actcaggatt tcctgaaatt ttatgccact 18300 
gacaacgtac agaaaaaaga caatgcactg atgtatatgc tgggggtgga ggatgcgaca 18360 
gaaggtaaag cctggtgtgg atatggtcag gttgacagta taacaataaa ccatactgtg 18420 
ctgacctggt ttgaacagca cgcagtgaaa aagcctgatg taagggcttc aatactaata 18480 
gaggaagcat tagttaaaaa ttttccctgt cagaggacag actcctccat aaaaattgct 18540 
tcccggtcat ctcccatttt atccctgacg ccggatgcgc ttaatctttc aggtaatgac 18600 
ttttttaaat tttgggtgtc tggtaatcaa cgggataaac tcagggcggg tgtctatctg 18660 
ctcggcgtgg aggatgcgac agagaacaaa ctgtggtgtg gatacgcttt atttaagacg 1872 0 
ctaacattaa atgaattagt ctatgtttct cttaaaaata aaaccaatga ggaactgaat 18780 
tctcgcgcgg ctgaacttat cataaataaa ttaatagagt atccctgtaa tatataaaat 18840 
cattcaagtt gcatcaaggc ggcaagggag tgaatccccg ggagcgtaca ttagttcgtg 18900 



actggggtga 


gcgaggaaag 


ccaacgcaca 


tgcagcctga 


agtatgacag 


gtataccctg 


18960. 


tgataatgtt 


atcgctgcca 'gttcagataa 


agtcttggtg 


gtaaagtctt 


ggtggtaaga 


19020 


tattgatatg 


tttaaggtaa 


tggtctgtct 


gttggtgggt 


gttcctgcca 


tttcttatgc 


19000 


gcacgattat 


gggtgtgcta 


cggttggagc 


atcaatggag 


tcatcattat 


ttgatgccat 


19140 


aaaaaatgac 


ctgaatatag 


atgtcgctac 


tattataaag 


gataaaacaa 


aggtagagat 


19200 


tcttgatatc 


tcacctgtat 


ctaaggtcta 


tgcagaatct 


ctggccagga 


tggattatga 


19260 


aaaagataag 


gccaaaaata 


aagtggcaat 


attagataaa 


aaatcctatt 


tcgatagtta 


19320 


ctatgaaaat 


caggttaaat 


ctatcgtggc 


aaaatatacc 


tatattaaca 


aagataaaga 


19380 


aaaagatatt 


ttcattgcat 


ccagcttcat 


gaatgctgat 


gagtgttctg 


taagatttaa 


19440 


tggttatatt 


actttatcca 


gggaatttta 


aaatattgat 


tattgggtgg 


cgagtcgaaa 


19500 


tataatgcaa 


aaatttctta 


gtctgctttt 


ttcccggcgc 


gcgctggcag 


ttgtgggcgt 


19560 


tctggttctg 


gcgctgctgg 


tctggtttgt 


cgggccgctg 


gtgtcatttg 


ataccctgcg 


19620 


cccgctggcc 


tccgtgggta 


gccgggtagt 


gaccattgcc 


ctgttgctga 


tgctgctggt 


19680 


actgtggctg 


gtcaactggt 


cgatgagtat 


catcggcatc 


agtgtcctgt 


gcctggcgat 


19740 


tggcttcgtc 


acaccgctgc 


tggccctggg 


cgatgtccat 


ccgtttgcgc 


cgctgtgggt 


19800 


ccgcctgacc 


ctgattggtt 


tcatcctgct 


gatgtacgcg 


ctgtacggcc 


tgtaccggct 


19860 


gtggcgtgcg 


ctgcgtatgg 


atgaacaact 


gctgcgtcgc 


ttcctgcatc 


cgcgcgggga 


19920 


agaggtaccg 


gtggcaggcg 


agatcaaagc 


cgacctgcgc 


accgtcaacc 


atattgtcac 


19980 


gcaggccatc 


cggcagctgc 


ggcagttgcg 


ggtggatatg 


cctggctggc 


gtaaaatctt 


20040 


cgagggaaaa 


cgctttctgt 


atgagctgcc 


gtggttcatg 


gtggtcggca 


gtcccggcga 


20100 


cggcaaaacc 


acggccctgc 


tgaacaccgg 


attgcagttc 


ccgctggcgg 


agcaaatgga 


20160 


gcagacttcg 


cgcatcctga 


cagtaccggg 


tggcggcacg 


ctacactgcg 


actggtggtt 


20220 


taccaacgaa 


gcggtgttga 


ttgataccgc 


cggacgctac 


gcccgccacg 


atgacggtgg 


20280 


tgaagcgagc 


gccgcgcagc 


gtaacgccgg 


agagtggcag 


ggctttctcg 


gtctgctgcg 


20340 


taaacatcgc 


cccggcgcgc 


cgcttaacgg 


cgtgatcctg 


acgctaaacg 


tggcggattt 


20400 


aaccgcacag 


tcaccggcgg 


aacgcctggc 


ggcctgcgcc 


gctctgcggg 


cgcgactggc 


20460 


agaactgcgc 


gagaccctgg 


ggattcgctt 


tccggtctat 


ctggtggtca 


ccaaaatgga 


20520 


tttgttgccg 


gggttcagcg 


aatattttcg 


cacgctgacc 


agccatcttc 


gtgcacaaat 


20580 


ctggggcttc 


acgttgccgt 


acagccgcag 


gcgaaaagcg 


ggcgacccgc 


aggcgctgca 


20640 


cgccgcctgc 


gcgcaggagc 


tggcgcgcct 


gacgctgcgg 


ttggatcagg gactggatac 


20700 



acHKcttaaaag ccgccagcgg ctgtata^^^Ec 



ccggttacag gaagagtac^HRcttaaaag ccgccagcgg ctgtatacW|^ccgcgtga 20760 
gttcgccgcc ctcggcgagc cgttgctgga ggctattgaa cagatcttcc tcgattcaaa 20820 
attcgatgcc acgcaactga ataacacgct gcgcggggtg tttttcacca gcgccgcgca 20880 
ggcgcaggcc gacgccgtgg ccgaccagtt gagtatctgg cagcgctttg tccgggcgat 20940 
aaaaaccgcc cgtggcgaat cctccgcctc tctcccacac gctctgccgg acggcaaccg 21000 
cagctacttc ctgcatgacc tgctgacaca gtttattttt cgtgaagcgc acctggtgga 21060 
gccaaacctc cagtgggcct ggcgttaccg cctgctgcgc ctcggcgggc acctgctggt 21120 
actggtgctg gcattcctgt tgtggcaggg gatgcagacc agccagcaga ccaacggcga 2118 0 
ctatctgaat gaaatcagcg cccgcgcgac ccggctggac ggtgatgtga aagcctacac 2124 0 
cggtaaaccg gcgatggctc ccgtcccggc actgctggac agcgcaaggg aactgtccgc 213 00 
ctggccggaa ctggacccgg acgcgccgcc gctggcctgg cgctacggtc tgtacagcgt 21360 
accgccggta accgacagcg tggcgtcgct gtacaaccgt ctgctggatc aactgctgct 21420 
gccgccgctg gtgaaacgga tggagtatgt gctggcggac gccattgccc gtcaggatag 214 80 
taaagcggcc tacgatgccc tgcgcatcta tctgctactg aatctggata aagatcacga 2154 0 
agataaatac aacgcggcgg agatccagtc gtgggtgatt aacgatctgg ggaacagcga 21600 
cagcgtggcc gggttcggcg ggcgcgccgc cgtgctgacg catatcgaag cgctgtttga 21660 
cggcagccgg gtggtgcatt caccgtatga gaaagatgag gcgctgatcc gccaggcgcg 217 2 0 
ggcattcctc gacggtcaca ccagtaccga gcgtatctac gcgcgggcgc tggcggcaat 217 8 0 
ggagagcgaa gcgccgcagg agttcacgct ggtacgcgcc gtcggcgcgg atgcgggaac 2184,0/ 
ggtctttgtg cgtagcaacg gcgcgccgct ggatcggggc gtgccgggta tttttacccg 21900 
tgaaggatac cgggagctgt tcgacaaacg attaccggaa tttgtggcgg cggcgacggc 21960 
gaacgatggc tgggtgatgg gccgggagag tacgccaaaa aagctgactg acagcctgcg 22 02 0 
cagccagata ccggggcagg agcagtctgt cgcccgcgaa gtccgccgtt tgtacctgac 22 080 
ggaatatgcc cgccgctggc aggattttct ggacagtatc catagtatca acagtgccgg 22140 
ggaagagggc agttccggcc tggcctatga tttacaggtg ctgcgcaccc tggcgtcgcc 22200 
ggactcaccg ctgatgcggc tgggaaaagc ggtggtggag cagaccacgc tggtgccgcc 222 60 
gccggacccg caggccagac agaaacaact ggcgcagcgc gcatccggta acgcggggaa 22320 
agtggtacag acggcaaaac tgttccagga tattcacccg gaagaacggc tggaaaaaac 223 80 
gctggtggat gatcgtttcg ccgcgctgcg cgaggtcatt gccgggcgga cggacggcgg 22440 
acagagcggc ggtgggacga tgcagatcgc ttcgctgctg accatgctca acgagtatta 22500 
cacccaactg accattgctg atagcgcgct ggcggcgggg acgttgccag cgcgcattac 22 560 




tgccgcagac 


aagttgcaac 


tggaggcggc 


gaaactgccc 


gcgccgctga 


aaaacatcct 


22620 


gctg§atctg 


acgaagcagg 


gaacgcgcaa 


aatcaacgcc 


gggaccggcg 


acgtgctgaa 


22680 


cacccagatg 


gaggcgatga 


tgggcgacga 


ctgccgtgac 


gccatcgacg 


ggcgctatcc 


22740 


gttcgccgac 


agtccgcagg 


aggtcagcgc 


cgaggacttt 


aaccgcatct 


tcgccagcgg 


22800 


cggcgtactg 


gatgctttct 


ggagcaaaca 


actggctccg 


ctggcggaca 


ccgccagcga 


22860 


cccgtggcgc 


tacaaaccga 


ccgaaggcaa 


catgacattg 


caggggccgg 


atctgacgcc 


22920 


gtttcagcag 


gcgaagcaga 


tccgcagcgt 


attttttaac 


agcgagggcg 


ggaaaaaatt 


22980 


ttcctggtcg 


atgcagatta 


gcgtggtgga 


tatggacccg 


gccatcacgg 


aactggtgat 


23040 


tgatatcgac 


ggccaggtgc 


tgcgctacgc 


ccatggtccg 


gaccgtcccc 


tgaaagtaac 


23100 


gtggccggga 


ccgcgcaacg 


gctcgatggc 


ggaaatcacc 


gccagcccgc 


gtattcgcca 


23160 


ggacacctca 


acgctactga 


ccggcgggcc 


9tgggcgctg 


tttcatctgc 


tggacgccgg 


23220 


aatggtacag 


gaaacggcgg 


tgcgggggcg 


tcaactggtg 


gaatatgatt 


ttgacggtcg 


23280 


ccgggtggtg 


ctggaaatca 


ccgccgggcg 


ggattttaac 


ccggtcagcc 


999 a 9 c tgtt 


23340 


gcagaacttc 


agttgcccgg 


cgagggcgct 


gtaatgcgcc 


ggccagcagt 


ggtatcagcc 


23400 


ccggcgatgt 


tcggcaggct 


gccggaccag 


cgcgactatg 


tgcgctggcg 


a 9tgggcgct 


23460 


gaagagggac 


gaatatggca 


ggactggctg 


aaccgccaga 


cgtgggtggg 


cagcgggcgg 


23520 


catatcgtgt 


tgccgggagg 


ccagacgacc 


gatgaagagc 


gtgacggctg 


gatgcacctg 


23580 


tcgccgcgtg 


gcgaacatga 


tatgcccgcg 


ccggaacacc 


atccgctgcc 


gtggagcttt 


23640 


gtgatgtcgc 


cgggattttt 


gccgatgggt 


gggacgactg 


actggctgac 


cggagtgctg 


23700 


atggcgtcac 


gggatagcgt 


gggacgcccg 


tggccgctgg 


tgatttacca 


gcgctgcggt 


23760 


cgggagtggc 


tggacgagag 


tttgcaggag 


acgcagggct 


ggctgtactg 


gctggcgagg 


23820 


ctggcggcgc 


agcatatcac 


gccggacacg 


atgcggcgcg 


ggcggctgac 


cgagcaggtg 


23880 


gaccagctat 


gggcgatgtg 


gcagccgggg 


ccgtggtggg 


cgcagtggct 


gcgtgggtta 


23940 


cgacgtacat 


cgcaacgcag 


ccgggaactg 


accgggctgc 


cggatgaagc 


accggtagta 


24000 


gaactgccgg 


gcgtgcgtta 


tctgccgtgg 


ccgggctggc 


cgggtaagac 


gctggggcag 


24060 


gcaacaccgg 


ggcaaggctg 


gttctggcag 


cagaacagtg 


aa gggcggta 


tgtggatgcg 


24120 


ctgcggttgg 


tggatgaaaa 


aaatagaggt 


cagatgctgt 


gaaaaagtac 


gctgtagagg 


24180 


ttctgtttat 


gtctgcctgt 


gcaggaatgt 


tcctgcccgt 


atttgcctgg 


ggcggaacag 


24240 


atgtaaacat 


tgataacccg 


ctggcggagt 


gtgtggatat 


ccatcccgtt 


catcgtcagg 


24300 


aaatggataa 


ccttaccatt 


ctgaaaacaa 


cggtcacact 


gaaaaaaagc 


accggtgaat 


24360 



% 28 % 

-ta^Kcaactata ccagcttact ggcgcaggat^tt 



gtggctgttt ctcaacattaWcaactata ccagcttact ggcgcaggat ^ttgagggtt 2442 0 
acggacgcgg aagtgcataj: Jtcccttcagg aggggaatat ctcactggcg aagatgcagg 24480 
gacgttatcc tttcagcttt gtattatcgg tggataacca gtctgtacgg gatcagaagc 24540 
tggcgctaat gatacgttgt acaccaccgc tgtaatacac agaatagtca gggagaagat 24600 
gatggcagta agactgactt ttgacgggca aaagctgaca tggcctggta tcgggatatt 24660 
taaggcgacc acggggttac cggatttaca gtggccagat aaacagtgtg tgccggatgc 24720 
ggcgataccg gaagggaatt ataaattgtt tattcagttt cagggggagg caccgataag 24780 
aaatgctgcg gattgtgatc tgggaccatc atggggctgg agtaccattc cgcgaggcca 24 84 0 
ggctgccgga acatgtgaga tatactgggc gaactgggga tataatcgta tccggctgga 24900 
atcagcggat gagaagaccc gaaaagcctg tgggggcaag cggggtggtt tttatatcca 24960 
tgattccacc. aaaggttaca gtcatggttg tattgaagtg gaaccggtgt ttttccgtat 25020 
tctgaaacag gagacggaaa aagaaaatgg tgaaaagaca tttacggtta atgttaagta 25080 
tgtttctggt cagcaaacga atggtggaac aaaacaataa taccgttaaa accggatgaa 25140 
attatcgtgt ctggacgtgc tgtggttgac aatgattaca atggtggcca gataagtaat 25200 
gagcaccaat gataaagtat atgactggcg gtgtgctggt tattaatcaa atttctataa 25260 
aatgcaatgc gaatctggta agcgtaataa aaataaaatc ataatattgt tatgttcatt 25320 
tccttattta tgtaattcag tatttatgtt atgtgctaat ttttgtgttt ttattttcat 25380 
ggctcttgtc agcaatatac cctgttcttc tggtaaatat ttaaatataa caggctggtt 25440 
gcattataaa gtgcggggca ctgtttcctg acggtgagtc tatttatttt aatccggtat 2550Cf 
taaaggagtc actaccatga gttttgtatc cacaaataat aaatccggta tgggagggct 25560 
gacgacaacc acgccgccga taaccggaga aagtggcggt gtcaccgcag attcagtcgc 25 62 0 
cggaagcgtg gcagatgcgg cggaatccgc cgtggaacag gctgcgggat cgctatttgg 25 680 
cgcattgccg gagccatcag gactggtgaa agccgcggta gcagcggcgc aggctgccgc 2 574 0 
cgccgcaggt atggcgcagg atgcggtatc ggccatcgtc tctgctgttg caggcgggcc 2 5800 
gggggcgcat aatgtgacgg tcagcggcag cgccgtaccg ccgggcgcat tactgttcgc 25860 
cagcctggac ggcggcgaaa cattaagtga actgttcagc tatgtggtac agctaaaaac 25920 
gcccgacacc ctgaatctgg gctatgtctc cccggcggcc aacctgccgc tcaaaccgat 25980 
ggtgggcaaa gatctgtgcg tcaacatcga actggatggt ggcggtaaac gacatatcag 2 6040 
cgggctggtc acggcggcgc gggtggtggg ccatgaaggg cgttcggtta cctatgagct 26100 
gcgtatggag ccgtgggtaa aactgctgac ccataccagc gactacaaag cattccagaa 26160 
taaaaccgtg gtggatattc tggatgaggt tctggcggaa tatccctacc cggtggaaaa 2 6220 



gcggctggtg gaaagctacc cggtacgcac ctggcaggtg cagtacggtg aaactgattt 26280 
tgattttctt cagcgacfga* tgcaggagtg gggcatctac tggtggtttg agcacagcga 26340 
ggacagccac acgctggtgc tggcggatgc catcagcgcc cacaaagcat gtccggactc 26400 
gccgctggtc gagtggcacc aggaagggct gaagctggac aaggagttta tccacactat 26460 
cacggcaaac gagagcctgc ggactggaca gtgggtgctg gatgatttcg attttacgaa 26520 
gccacgttca ttgctggcaa acaccgtggc aaacccgcgt gaaaccggtc atgccaccta 26580 
cgagcattat gagtggccgg gagactactt cgacaagagt gaaggcgaga tgctgacgcg 26640 
cattcgtatg gaagcgcagc gcagccccgg cagtcgggtg ctggggggag ggaatatccg 26700 
cacactcatg accggttata ccttcacgct ggaaaactat cccaccgccg aagtcaatca 26760 
ggaatatctg ctgatgcaga ccttgctgtt tgtgcaggac aacgcgcagc acagcgggca 26820 
ggaccagcac tttacctttt ccacccgttt tgaactgcac cccacccgcg aggtgttccg 26880 
cccgcagcgg acggtgagca aaccccacac caaagggccg cagagcgcca tcgtcaccgg 26940 
cccggcgggc caggaaatct ggacggatca gtacgggcgg gtaaaggtac agtttggttg 27000 
ggatcgctac ggcaaaatgg atgaaaacag cacctgctgg atacgcgtca gctacccgtg 27060 
ggcgggcaaa ggcttcggga tgatccagat cccgcgtatc ggccaggaag tgctggtgga 27120 
tttcaaaaac ggcgatccgg atctgccgat catcgtgggg cgtacctaca accaggacac 27180 
catgccgccg tggggactgc cgggaatggc gtcgcagagc gggatcttca gccactcgct 27240 
gtatggcggg ccaacgaacg gcaacatgct gcgttttgac gacaaaacgg gcgcggagga 27300 
agtgaagttc cacgcggaaa aagatctcaa caccacggtg aagaataatg aaacgcatac 27360 
ggttatggtg gatcgcacta aaaccattat taaaaatgaa accaacagta ttggtgagga 27420 
cagaaacacc acggtaacga agaatgacgg cctttccgta aaactggcgc agacgatcaa 27480 
tatcggcacc acttatcgtt tagatgttgg cgatcaattc acgcttcgct gcggcaatgc 27540 
ggcgcttgtt ttacataagg acggctccat tgagttttgt ggcaagcaac tgatgttaca 27600 
taccagcgat gtcatgcaac tgattggtaa aggtattgat atgaacccgg atggcggcac 27660 
agccgtaacc gccgatgata ttgcccccct tctcacctct gagtgatctg aattaaacct 27720 
ggagttctca tggatcgacc ataccgcata caggaagggt gttttgtcct gcctgaaaca 27780 
tttacggatc gcagcgtcaa tatttttatc ctggagggca atgaacgaac atcgcccagc 27840 
ctgaatattt cccgcgatac gctaaaacct gatgaagacc tgcccgccta tattgaccgc 27900 
cagattgcac tgatgaaaaa aaatctcggt cagcaccggg tattgtcgcg agcgcctgca 27960 
caggcaggaa cgggcaatga tgcccttatg ggggaacaaa ttgccgccac ccataaatcc 28020 



gggaaaacgg aagtgtacc^pfcgtcaggcc gggtttattg caacccctgg^aaggtactg 28080 
gtcttcaccc tgaccagtcc ccgtcctttt gatgataaag cagacctact ctggaacacc 28140 
tggctggcag gctttcagcc ggataaaaac gaataatcac acggaggtgt gaccatgtat 28200 
gaagcagccc gtgtggatga tcctatctac cacaccagcg cgctcgccgg gtttcttatc 28260 
ggcgctatca tcggcatcgc cattatcgcg cttgccgcct ttgccttctt tagctgcggt 28320 
tttcttgccg ggctgattct gggttttatg gccgatcaaa tagcctccgg ggtattgcaa 28380 
ctgggcgagg ccatcgggcg ctccatccac cacacggcag gaaaaatcct caccggttcg 28440 
gagaatgtca gcaccaacag tcgcccggcg gcgcgcgcgg tactgagtac ggtgaaatgc 28500 
gataaccata tcgcagaaaa acgcatcgcc caagggtcgg aaaatatcta catcaacagc 28560 
cagcccgccg cccgtaagga tgaccacacc gaatgcgacg cggtgattga agacggttcg 28620 
ccgaatgtgt ttctcggcgg cggcacacag acggtactgg aaatcagttc tgaaattccg 28680 
gactggctgc gcaaggtggt ggatgtattg tttgtcgtgg cgagtctgct cggcgggctg 28740 
gccggggcgt ggcggcaggc ggcaaagctg gggacgaaat ttggcactaa atgtgccgct 28800 
aagtttatcg gcggggagct tgtcgggatg gccgtgggtg aggctatcag cgggctgttc 2 8860 
agcaatccgg tggatgtgac caccgggcag aaaatcctgc tgccggaaac ggacttcacc 28920 
ctgcccggtc gcctgccggt cacctgctcg cgtttttacg ccagccacct ggaaactgtg 28980 
ggactgttgg gacggggctg gcggctgaac tgggaaacca gcctgcgcga tgacgatgaa 29040 
cacatcacgc tgaccggcgt acaggggcgg gaactgcgtt acccgaaaac gatgctgacg 29100 
cccggccacc agatatttga cccggaagaa cagttatacc tcagccgcct gcatgacggg 2 9160' 
cgttacgtgc tgcattacac cgatcgcagc tattacgtat ttggtgattt tgacagtgac 29220 
ggcatggcat acctgctgtt tatggagacg ccgcaccgcc agcgcattgt cttcgggcac 29280 
gaaggaggca gactggtacg gatagcctcc agcagcgggc atcacctgtt actgcaccgc 2 9340 
acacagaccc cggcagggga gcggctgtcg cgaattgaac tggtgcaggg cggcacccgt 2 94 00 
ggcaatctgg tggagtaccg gtatgacgat aacggtcaac tgaccggcgt ggtgaaccgg 29460 
gcgggaacgc aggtgcgtca gtttgcttat gaaaacgggc tgatgacggc gcacagcaat 29520 
gcgacggggt tcacctgccg ctaccgctgg caggaactcg acggcgcgcc gcgcgtgacg 29580 
gagcacgaca ccagtgacgg cgaacattac cgctttgact atgattttgc cgcaggcacc 29640 
accaccgtca ccggcaggca gggggagaca tggcagtggt ggtacgacag ggaaacgtat 2 9700 
atcaccgcgc accggacgcc gggcggtgga atgtaccgct tcacgtacaa cgaagaccac 2 9760 
ttccctgtca acattgagct gcccggcggt cgcacggtgg cgtatgaata tgacatccag 2 9820 
aaccgggtgg tgaagacgac agatccggaa ggccgggtga cgcagacgca gtggaacggc 2 9880 



gagttcgacg aaatcacgcg cacggcgctg gacgatgacg ctgtctggaa aacgcagtac 29940 
aacgcccacg gccagccag"t gcaggagacg gacccggaag ggcgggtgac gcagtacgct 30000 
tacgatgaac aggggcagat gtgcagccgg acggatgcgg cgggcggcac gcagggcggc 30060 
gtgcggcggg agacgcagca gcgggatgcg ctgggccgtc tgttacggac ggagaatgaa 3 012 0 
cacggccagc ggacgttcag ctacaaccgg ctggaccaga taacggcagt gacgctcacg 30180 
cccacggagg cggggcaaca gcagcaccgg atgcaggccg acacggtgcg ttttgagtat 3 0240 
gaccgcagcg gctggctgac ggcggagcac gcggggaacg gtagcatatg ttatcagcgc 30300 
gacgcgctgg gcaacccgac ggacatcacg ctgccggacg ggcagcacct gacgcatctg 30360 
tattacggga gcgggcatct gttacagacg gcgctggacg gcctgacggt gagcgagtat 30420 
gagcgcgaca gcctgcaccg tcagataatg cgcacgcagg ggcagcttgc gacgtacagc 3 0480 
ggctatgacg acgacgggct gctgagctgg cagcgcagtc tggcgtccgg cagtgcccct 30540 
gttcttcctg gccagcgccc ggcgcggcag ggctgcgtga cgtcgaggga ctattactgg 30600 
aacaaccacg gcgaggtggg cacgattgac gacggcctgc gtggcagcgt ggtgtacagc 30660 
tatgacagaa gcggttacct gaccgggcgc tcaggtcaga tgtatgacca tgaccgttat 30720 
tattacgata aggcgggcaa cctgctggat aacgaagggc agggagcggt gatgagcaac 30780 
cggctgccgg gctgtggtcg tgaccgttac ggctataacg agtggggcga gctgaccacg 30840 
cggcgcgacc agcaactgga gtggaacgcg caggggcagc tgacgcgggt catcagcggc 3 0900 
aacacggaga cgcactacgg ctacgatgcg ctggggaggc gaacccgcaa ggcgacgtac 3 0 960 
gggcggcaca cgggccatac ggcgcggagc cggacggact ttgtgtggga ggggttcagg 31020 
ctgttgcagg agaacgtgca gcagcagggc tggcggacct atctgtacga tgcggaacag 31080 
ccgtacacgc cggtggcgag cgtgacgggg cggggagaaa gcaggcaggt gtggtattac 31140 
cacacggatg tgacgggcac gccgcaggag gtgacggcgg cggacggaac gctggtgtgg 31200 
gcggggtata tcagggggtt tggagagaat gcggcggaca tcagcaacag cggggcgtac 31260 
tttcaccagc cgctgcggct gccggggcag tattttgacg acgagacagg gctgcattac 31320 
aatctgttca gatattatgc accggagtgt ggacggtttg tcagtcagga tccgatcggg 31380 
ctgaggggcg ggttaaacct ttatcagtat gcgccaaatc ctctcaaata tatagaccca 31440 
cttggtttaa ccgcgactgt tgggcgatgg atggggcctg cggaatatca gcaaatgctt 31500 
gatactggga cagtagtaca aagttcaaca gggacaactc atgttgccta ccctgctgat 31560 
atagatgctt ttggtaagca agcaaaaaat ggtgctatgt atgttgaatt tgatgtgcct 31620 
gaaaaatcat tagtacctac aaatgaagga tgggcaaaaa tagtagggcc agattctatc 31680 



ac^BKaaggtttg cctgttcctg aaatgccaac^gc 



gaagggcgat tagctaaac^BEaaggtttg cctgttcctg aaatgccaac^gcagaaaac 31740 
ataactgtaa ggggcgagaa aattaatggg gaagttgaag caaaatgcta aataaattta 31800 
aattgtgggt gagcaaacat actgattata cggtaattca taatgaaaat gatttatctt 31860 
acagtattat tatagatttt gaagatgacc ggtatatatc aagatttact gtatgggatg 31920 
acctaagctg tatgtcagaa gtaatggatg tggatactgg tttatataaa ttaaacaaga 31980 
gaaacgaatt ttctacattt gatgaacttc tggatatatt tgatgatttt atgataagta 32040 
ttaaataata gttggccggg taagaagtta actcttcccg gctgttttat tatctaaccc 32100 
ccatcaatcc ggagacgcgc taccggtacg atgcgctggg caggcgggtg agcaaggcga 3 2160 
cgtacgggcg gcacacgggc catacggcgc ggagccggac ggactttgtg tgggaggggt 32220 
tcaggctgtt gcaggagaac gtgcagcagc agggctggcg gacatatctg tacgatgcgg 322 80 
aacagccgta cacgccggtg gcgagcgtga cgggaaaggg agaaagcagg caggtgtggt 32340 
attaccacac ggacgtgacg ggcacgccgc aggaggtgac ggcggcggac ggaacgctgg 32400 
tgtgggcggg gtatatcagg gggtttggag aaaatgcggc ggacatcagc aacagcgggg 3 24 60 
cgtactttca ccagccgctg cggctgccgg ggcagtattt tgacgacgag acagggctgc 32520 
attacaatct gttcagatat tatgcaccgg agtgtggacg gttcgtcagt caggatccga 32580 
ttgggctggc ggggggggct gaatctttac cagtatgcgc ctaatccgat tagatggatc 32640 
gatcctttag gacttgctat cctggagcat caatctaatt ttgatgcggc aaggagaacc 32700 
ggatttgaaa atgcgggtat gacaaaccct gaggatgtca ctttctcgaa agtcgatccc 32760 
aaaactggta ctgttgttga gtttaaaggt ccaaatgggg ctaaagttgc ttatgatgca 32820" 
cctcatgcag atatggatgt gacagcaggg catgataaac cacatgttgg ttggcaatcc 32880 
gcaggaaaaa gaggttccgg aggagctaat agaggtaata ttacttatga tggcccacaa 32940 
catccgcatc gctctgactc taagggagat gataaatgtt aaattcaaat atgtctgaac 3 3000 
ttagaatcga actggagaat gcgattaaaa atctcggtat tcatgattat cgtgtcgata 33060 
aacccgaaca aatcgtttct gagataaaag agatatatgt taatggtaat cctagaacct 33120 
ggtggttatc attaaaacat agacaatatg tcttttctta taccgataat tctggatata 33180 
aaaacatatc acaaatagta agtaaacaac tcaatgaaag caatgtaatc aacaaacata 33240 
tatttttgat tgctgatgaa gataatgagc aaatatatgt atataacgtt cctcttaact 33300 
ccctgcctga aattatagaa aattgcagat attttgaata ttatgttgca gatcatgaac 33360 
tatcttggct tatatgtgaa aatgatcatg gtgatttgat tgtatgctca accattaagt 33420 
aaagcgcgag tgctctttag cgatatagtt gcccatattt aggcgttact agccgaagat 33480 
ggcgcgattg tctggcaggg gaaacagcaa ttctgaggtc aggaagatag cataacccat 33540 



taaccgggat aga-tccgcta gacctgaatc 
gttatttgct cctggagcaa ataaacctta 
acgaagggcc gagcattaaa gcactggcag 
atttgatgaa aatgtaatct aatggaaagt 
atataaaacc agaaccggaa ccataggtga 
atataattca tttgatcatg ggcgaacaga 
aaatagtaag ggagttggtt ccggtggagg 
ggctgtgata agaagtcgag aacaatgctg 
tttaaattag atgaagatag atattgtttt 
catctttctg aattatttga tataattaaa 
agtaatgcaa ggcgaattat tgaaccaatt 
aaattagaaa atggaagtga ctggagaatt 
aatttagatg gtatctattt tgcacttgga 
tacggaaatg attttctcat ttcagaaagt 
aaagggggtt ttgatatcaa aaaacggctt 
gggaaagatc ttttgtcttc ccggatttta 
cccccgcctt aatatgcttc atcgactttt 
tcacttcgtc cgtctctgca atcaggatca 
cgtgcccggt ctcaaatccc gcttcttcct 
tttgtgagta acaggcggtc atgcaggttt 
aggcattcat atttgctgac taaataaatt 
tggagctcag ggtgagtgag tatgggcgcg 
gccagccccg gccggtgatg agcagccggc 
gcctgcatta aaatctgttc agatattatg 
atccaatagg gctgaaaggg ggatggaacc 
atagtgatcc tcttggcctt attacttgtg 
taagatgaaa aatggttgaa aataattgcg 
caaaaagtaa cggatttgct tgctggaatt 
gttgatgttt ttactacagg aataagtatg 
agtaatttaa tgttgagatt cggtaaggag 




cagttgatgc 


gacaggttat 


agggtttatg 


33600 


ttacattggt 


attactaatg 


atatggtttg 


33660 


gttatcaaaa 


gaaaatggaa 


ggatgctgcc 


33720 


cagaggttac 


gagaaatatt 


atatagagaa 


33780 


aaaaattccc 


tcaacaaata 


gagagaataa 


33840 


tcccagcgca 


caagcattta 


aagactctta 


33900 


aaaatgcgga 


tgagtgatta 


agaattttgg 


33960 


cgttttgtga 


agcccggaga 


catattttgt 


34020 


gggcgaatta 


taacactaat 


gactgtcgga 


34080 


aaaccccctg 


gaataacaga 


gttagaaatt 


34140 


atagtggata 


catattcttt 


atttgataag 


342 00 


attggtcatc 


aggttaatta 


caatccaaaa 


34260 


ataggtgatt 


cctgtaaaaa 


gaaagactgt 


34320 


gagtggaaaa 


cacttcctaa 


attatctcct 


34380 


gaaattgcct 


gaaaatgaaa 


ataaaaagcc 


34440 


ttatttaatc 


cccgttcacc 


acattattta 


34500 


tcacctgata 


aagctccttc 


cgtagatccc 


34560 


aacacccctc 


ggagatcttc 


acggtgacgc 


34620 


gccagtcacc 


cttaaggtgc 


tggctgggga 


34680 


cgctgttgat 


atggcggacg 


ctgacgccgg 


34740 


cttatttatc 


cgccggatgc 


tggctgattg 


34800 


acatcctggc 


accgcgcctc 


cctctccccg 


34860 


tgccggggcc 


gtattttgac 


gatgaaacgg 


34920 


taccggagtg 


tggctggttc 


gtcagtcagg 


34980 


gatatcattc 


tccgctgaat 


cctattacag 


35040 


gtgctgatag 


aggtgattct 


ggcaagttat 


35100 








■J J X D \J 


gtgttaacga 


atgcaaataa 


aatcaatgct 


35220 


gttaacgata 


aggatacagc 


tatattaatt 


35280 


cttgatgaat 


ctgttgctgt 


tgttcagtcc 


35340 



cgttgtgatg aggatgaat 




gtatat 



cgagaaacgg ttggttttat: '* datgggtgaa 




35400 



atgcttatta 


aaataatgaa ^ 


tccattatat 


gaaaaacatc 


cagaaataaa 


accaaaagga 


35460 


ttgaaacaaa 


acatctggaa 


ccggatgaat 


aatgtgtaaa 


agccggaggg 


gttatctttt 


35520 


cccggctttt 


tattatcaat 


tactcattaa 


ctcctgttcc 


gttcttttgc 


gtttaatcac 


35580 


cggaatatct 


ccggtattgt 


tcagcgcccc 


ggaaatgttt 


ttaaccactg 


ttctgcactc 


35640 


cgtttattaa 


tgcgggttac 


gcccatccct 


tcaatacagc 


caaagagtcc 


gtgggtatgc 


35700 


tgcggcgtga 


tcacgatgca 


atccctcatt 


acccgcacct 


tgacgggcat 


cccgttaata 


35760 


aaccccgcct 


gcggcatcca 


ttcaccgtac 


aaacgaactg 


aaagtctctc 


aacgcgtgag 


35820 


tatgtaagta 


tcccgcataa 


tcgagccatt 


cacatttaga 


gatcatccga 


cataatcaat 


35880 


ctgccaacgc 


aggagatcgc 


tatgcgtaaa 


gcccgtatta 


ctgcgcacca 


gatcatcgct 


35940 


gtgattagat 


cagtcgaatc 


cggacggact 


gttaaagatg 


tctaccggga 


gcccggtatt 


36000 


tzctgaagcca 


ccagggacaa 


ctggaggtct 


ggatacggcg 


gcagggatac 


gcgtggaatc 


36060 


acaaaaggct 


ccaccgtatt 


tactgtctgc 


tcaagctgaa 


ttttcgccgt 


aagggtaaac 


36120 


aacggctgcc 


ggtacgcaat 


ccctcgccac 


tggtcacgcc 


ggaagcgctg 


aaccagagct 


36180 


ggtctgtggg 


cgtcgttttc 


gcacgttcaa 


tgttgttgat 


gactgtaatc 


gtgaagcgtt 


36240 


gtcgattgaa 


atcgatctga 


atctgccagc 


tctgcgagtg 


gtccgtgtac 


tcgacaggat 


36300 


tacagcaacc 


gcggttatct 


ggccatgctg 


cgtatggata 


agggaccgga 


atttatctcg 


3 6360 


ctggcactgg 


ctgaatgggc 


aaagaaacat 


gcagtaaagc 


tggcgtttat 


ccagccgggt 


36420 


aagccgaaga 


aaaacgtttt 


catcacgcgc 


tttaaccgga 


cataccgtac 


agaaatactc 


36480 


aattcttatc 


tgttcagaac 


gctgaatgag 


gtgtgggaaa 


ttacggataa 


agggttatca 


36540 


gaatataact 


gcgaacgtcc 


acatgaatcg 


cggaacaata 


tgataccgaa 


ggaataccgc 


36600 


caataacgtt 


atctggccgg 


aatcttaaaa 


atgcatggaa 


ctaaaacggg 


tctatttaca 


36660 


ggggcacctg 


cgatgaattt 


cgctgcactg 


aaaagcgata 


ccggatgaga 


gctgcttcaa 


36720 


attaatgtgc 


catgttcacg 


gggaggttgt 


gcgacgtttg 


cataatccag 


caagaactga 


36780 


aaggaagggg 


agagcttttt 


catgcctgta 


taatcagtct 


ggcctgtgtc 


agtcagctct 


36840 


tagtgttgag 


actctcgttg 


gagcgttata 


attgcttttc 


tgtttcggaa 


aacaagattt 


36900 


tccattaaag 


atcttccctg 


cgaggaaaag 


ttaactaata 


atcttaccgt 


cgagttagga 


36960 


gatgtatgtt 


taaatataaa 


caatgttgca 


acgatgcctg 


ataattatcc 


tctcttcgaa 


37020 


gataagtttc 


ccacacccag 


tgtagtaggt 


gtcatggtaa 


tgttatcact 


tgaatgtaaa 


37080 


tggaaggtat 


aattgctttt 


tgactggcat 


tctattccac 


cctgacaaca 


cgatgttaac 


37140 


atcaacactg 


tttatattgg 


caataacgca 


atttttttca 


gattaagagg 


tgctctgata 


37200 
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tatagatttt tatgacatta cttatttgaa ttggtaacaa ataccaataa gtacaagctg 37260 

ttattaccag ccacggattt tttacatacg gtaagatttg gtatggcgtt atgtattctg 37320 

gatgtgctgg attattttaa tttggtttaa aaaaggtggt tattcaa atg aaa age 37376 

Met Lys Ser 
1 

ata aaa aaa ttg att ate gca agt gcg ttg age atg atg get get agt 37424 
lie Lys Lys Leu He He Ala Ser Ala Leu Ser Met Met Ala Ala Ser 
5 10 15 

tgt tat get ggc tea ttt ttg ccg aac tea gag caa caa aaa tea gtg 37472 
Cys Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin Lys Ser Val 
20 25 30 35 

gat att gtg ttt tec tct ccc caa gat tta ace gta teg ctt att cca 37520 
Asp lie Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser Leu He Pro 

40 45 50 

gtg teg ggc tta aag get ggg aaa aat get cct age gcg aaa att gcg 37568 
Val Ser Gly Leu Lys Ala Gly Lys Asn Ala Pro Ser Ala Lys He Ala 
55 60 65 

aag ctt gta gtt aat tct act act ctt aaa gaa ttc ggg gtc agg ggg 37616 
Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly Val Arg Gly 
70 75 80 

att tct aac aac gtg gta gac agt act ggc act gca tgg cgt gta get 3 7664 
He Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp Arg Val Ala 
85 90 95 

ggt aaa aat act ggt aaa gag ate ggt gtg ggc tta tea agt gac agt 3 7712 
Gly Lys Asn Thr Gly Lys Glu He Gly Val Gly Leu Ser Ser Asp Ser 
100 105 110 115 

ctt aga aga tct gat age acg gaa aaa tgg aat ggg gtg aac tgg atg 37760 
Leu Arg Arg Ser Asp Ser Thr Glu Lys Trp Asn Gly Val Asn Trp Met 

120 125 130 

acc ttt aat age aat gac aca ctt gat att gtc ctg aca gga ccg gcg 37808 
Thr Phe Asn Ser Asn Asp Thr Leu Asp He Val Leu Thr Gly Pro Ala 
135 140 145 

cag aat gtc aca get gac acg tac cca ata act tta gac gta gtg gga 37856 
Gin Asn Val Thr Ala Asp Thr Tyr Pro He Thr Leu Asp Val Val Gly 
150 155 160 

tat caa cct taa tagtaaacaa ctattagtgt attgtgcctt gtttaaggcg 37908 
Tyr Gin Pro 
165 

caatacacat caaatcatct atttttcttt tacaattttt gat atg aaa ata gtt 37963 

Met Lys He Val 
170 



aat ttt get gta atg gcg gta get ttg ttc gee act aat tct atg gtt 
Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr Asn Ser Met Val 
175 180 185 



38011 



: aa^cag caa eta aat tea gec act aaaTt; 



tea gta tat. gec gtc aa^fcag caa eta aat tea gec act aaaxta ttc 38059 
Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala Thr Lys Leu Phe 

190 _ . fc . 195 200 

age gtg aag ctg ggg get aca cga gtg att tat cac get ggt acg get 38107 
Ser Val Lys Leu Gly Ala Thr Arg Val lie Tyr His Ala Gly Thr Ala 
205 210 215 

gga gcg acg etc teg gtg age aac ccg cag aat tac cct att ttg gtt 38155 
Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr Pro lie Leu Val 
220 225 230 235 

cag tct tea gtc aaa gca gca gac aaa agt teg cct get ccc ttt ttg 3 82 03 
Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro Ala Pro Phe Leu 

240 245 250 

gtg atg ccg cct eta ttt cgt tta gaa gca aac cag cag agt caa ctg 3 8251 
Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin Gin Ser Gin Leu 
255 260 265 

cgt att gtc cgt act ggt ggt gac atg cca acg gat cgt gag act tta . 3 8299 
Arg He Val Arg Thr Gly Gly Asp Met Pro Thr Asp : Arg Glu Thr Leu 
270 275 280 

cag.tgg gtc tgt ata aag gcg gta cca ccc gaa aat gaa ccg teg gat 3 8347 
Gin Trp Val Cys He Lys Ala Val Pro Pro Glu Asn Glu Pro Ser Asp 
285 290 295 

aca cag get aag ggc gcg acc ctt gac etc aat ttg tec ate aac gee 3 83 95 
Thr Gin Ala Lys Gly Ala Thr Leu Asp Leu Asn Leu Ser lie Asn Ala 
300 305 310 315 

tgt gat aag ctg att ttc cgc ccg gat gec gtg aag ggg acg ccg gaa 3 8443 
Cys Asp Lys Leu lie Phe Arg Pro Asp Ala Val Lys Gly Thr Pro Glu 

320 325 330 

gat gtt gca gga aat tta aga tgg gtg gag acg ggc aac aaa ctt aag 3 84 91 
Asp Val Ala Gly Asn Leu Arg Trp Val Glu Thr Gly Asn Lys Leu Lys 
335 340 345 

gtg gag aac ccc acc ccg ttt tac atg aat tta gec tct gtc aca gta 3 853 9 
Val Glu Asn Pro Thr Pro Phe Tyr Met Asn Leu Ala Ser Val Thr Val 
350 , 355 360 

ggg gga aag ccc att aca ggg ctt gag tat gtc ccc ccc ttt get gac 38587 
Gly Gly Lys Pro He Thr Gly Leu Glu Tyr Val Pro Prp Phe Ala Asp 
365 370 375 

aaa aca eta aat atg cca ggt agt gec cat ggt gat ate gag tgg aga 3 8 635 
Lys Thr Leu Asn Met Pro Gly Ser Ala His Gly Asp lie Glu Trp Arg 
380 385 390 395 

gtt att aca gac ttt ggt ggt gaa agt cat ccg ttc cac tac gtt ctt 3 8683 
Val He Thr Asp Phe Gly Gly Glu Ser His Pro Phe His Tyr Val Leu 

400 405 410 

aaa taa atccaggggc ttagcggcag aaa atg aag ttc aaa caa cct gee ttg 3 873 6 
Lys Met Lys Phe Lys Gin Pro Ala Leu 

415 420 



eta ctg ttc ate gcg gga gtg gtt cat tgc gca aat gcg cac act tac 3 8784 



37, 




Leu Leu Phe lie Ala 3^ Val Val His Cys Ala Asn Ala His Thr Tyr 
425 430 435 

aca ttc gat gca tea atg ttg ggc gat gca gcg aaa ggg gtt gat atg 38832 
Thr Phe Asp Ala Ser Met Leu Gly Asp Ala Ala Lys Gly Val Asp Met 
440 445 450 

teg etc ttt aac cag ggg tta caa cag cca ggg act tat cgc gtg gac 388 80 
Ser Leu Phe Asn Gin Gly Leu Gin Gin Pro Gly Thr Tyr Arg Val Asp 
455 460 465 

gtg atg gtg aat ggt aaa cgt gtc gac ace cgt gat gtg gtg ttc aaa 3892 8 
Val Met Val Asn Gly Lys Arg Val Asp Thr Arg Asp Val Val Phe Lys 
470 475 480 485 

ttg gaa aag gat ggg caa gga acg cct gtt ctg get cct tgt ttg acg 3 8976 
Leu Glu Lys Asp Gly Gin Gly Thr Pro Val Leu Ala Pro Cys Leu Thr 

490 495 500 

gtc agt cag ctt tea cgc tac ggc gta aaa acg gaa gat tac cct cag 3 9024 
Val Ser Gin Leu Ser Arg Tyr Gly Val Lys Thr Glu Asp Tyr Pro Gin 
505 510 515 

ttg tgg aaa gca gca aag ccc cca gat gag tgt gcg gat ctg ace gee 39072 
Leu Trp Lys Ala Ala Lys Pro Pro Asp Glu Cys Ala Asp Leu Thr Ala 
520 525 530 

att cca cag get aaa gcg gta ctg gat ate aat aat cag caa ctg caa 39120 
lie Pro Gin Ala Lys Ala Val Leu Asp He Asn Asn Gin Gin Leu Gin 
535 540 545 

ctg agt att ccg cag ttg gcg ttg cgt ccg gaa ttt aag ggg ate get 3 9168 
Leu Ser He Pro Gin Leu Ala Leu Arg Pro Glu Phe Lys Gly He Ala 
550 555 560 565 

cca gaa gat ctt tgg gat gat ggt att ccg gcg ttt ctg atg aac tac 
Pro Glu Asp Leu Trp Asp Asp Gly He Pro Ala Phe Leu Met Asn Tyr 

570 575 580 

agt gcg agg aca acg cag acg gat tac aaa atg gat atg gtg ggg cgt 
Ser Ala Arg Thr Thr Gin Thr Asp Tyr Lys Met Asp Met Val Gly Arg 
585 590 595 

gac aac tct tec tgg gta caa ctg caa ccg gga ate aat ata ggt gcg 
Asp Asn Ser Ser Trp Val Gin Leu Gin Pro Gly He Asn He Gly Ala 
600 60S 610 

tgg cgt gtc cgc aat gcg ace age tgg cag egg agt agt caa ctg teg 3 93 60 
Trp Arg Val Arg Asn Ala Thr Ser Trp Gin Arg Ser Ser Gin Leu Ser 
615 620 625 

ggg aag tgg cag gca gca tat acc , tat get gag cgt gga ctg tac tea 39408 
Gly Lys Trp Gin Ala Ala Tyr Thr Tyr Ala Glu Arg Gly Leu Tyr Ser 
630 635 640 645 

eta aaa agt cgt ctg act ctg ggg caa aag act teg cag ggg gag ata 39456 
Leu Lys Ser Arg Leu Thr Leu Gly . Gin Lys Thr Ser Gin Gly Glu He 

650 655 660 

ttt gat agt gtg cca ttt acc ggt gtg atg ttg gca teg gat gac aac 39504 
Phe Asp Ser Val Pro Phe Thr Gly Val Met Leu Ala Ser Asp Asp Asn 



39216 



39264 



39312 



665 



670 



38 



675 



atg gtg ccc tac agt -gag egg cag ttt get ccg gta gtg cgt ggg att 39552 
Met Val Pro Tyr Ser Glu Arg Gin Phe Ala Pro Val Val Arg Gly lie 
680 685 690 

gec cgc acg cag get egg gtg gag gtc aaa cag aat ggt tac acc att 39600 
Ala Arg Thr Gin Ala Arg Val Glu Val Lys Gin Asn Gly Tyr Thr lie 
695 700 705 

tac aac acc act gtg gcg ccc gga ccg ttt gca ctg egg gat ctg teg 3 9648 
Tyr Asn Thr Thr Val Ala Pro Gly Pro Phe Ala Leu Arg Asp Leu Ser 
710 715 720 725 

gta aca gac agt agt ggt gat ctg cat gtc acc gtg tgg gag gec gat 39696 
Val Thr Asp Ser Ser Gly Asp Leu His Val .Thr Val Trp Glu Ala Asp 

730 735 740 

ggc agt aca caa atg ttt gtg gtg ccg tat cag acc ccg gcg ata gca 39744 
Gly Ser Thr Gin Met Phe Val Val Pro Tyr Gin Thr Pro Ala He Ala 
745 750 755 

ctg cac cag gga tat ttg aag tac age ctg ttg gcg ggc cga tac cga 3 9792 
Leu His Gin Gly Tyr Leu Lys Tyr Ser Leu Leu Ala Gly Arg Tyr Arg 
760 765 770 

teg tea gac tct gca acg gat aag egg cag ate gcg cag get acg ttg 3 9840 
Ser Ser Asp Ser Ala Thr Asp Lys Arg Gin He Ala Gin Ala Thr Leu 
775 780 785 

atg tat ggt ctg ccg tgg aat etc act gca tac ggc ggt ata cag agt 3 9888 
Met Tyr Gly Leu Pro Trp Asn Leu Thr Ala Tyr Gly Gly lie Gin Ser 
790 795 800 805 

gca acg cat aat caa get gca ttg ctt ggt ttg ggg gga tct etc ggg 3 993 6 
Ala Thr His Asn Gin Ala Ala Leu Leu Gly Leu Gly Gly Ser Leu Gly 

810 815 820 

egg tgg ggg agt tta tct gtc gat gga age gac aca cac agt cag cgt 3 9984 
Arg Trp Gly Ser Leu Ser Val Asp Gly Ser Asp Thr His Ser Gin Arg 
825 830 835 

cag ggg gag gcg gta cag caa gga gee tec tgg cga ctg cgt tac age 4 0032 
Gin Gly Glu Ala Val Gin Gin Gly Ala Ser Trp Arg Leu Arg Tyr Ser 
840 845 850 

aac cag ctg act gcg acg ggg aca aat ttt ttt ctg acg aga tgg cag 40080 
Asn. Gin Leu Thr Ala Thr Gly Thr Asn Phe Phe Leu Thr Arg Trp Gin 
855 860 865 

tat gee teg cag ggc tat aac acc eta tec gat gtg etc gac agt tat 40128 
Tyr Ala Ser Gin Gly Tyr Asn Thr Leu Ser Asp Val Leu Asp Ser Tyr 
870 875 880 885 

cga cat aat ggc aac cgt eta tgg teg tgg egg gaa aat ttg cag ccg 4 0176 
Arg His Asn Gly Asn Arg Leu Trp Ser Trp Arg Glu Asn Leu Gin Pro 

890 895 900 

age teg cgt act acc ctg atg ttg agt cag tea tgg ggg agg cat ttg 4 0224 
Ser Ser Arg Thr Thr Leu Met Leu Ser Gin Ser Trp Gly Arg His Leu 
905 910 915 



% » * 

ggc aat ctg agt tta acc ggt tec cgt acc gac tgg cgt aat cgc ccc 40272 

Gly Asn Leu Ser Leu -Thr Gly Ser Arg Thr Asp Trp Arg Asn Arg Pro 
920 925 930 

ggt cat gat gac age tac gga ctg agt tgg gga acc tct ate gga ggg 4 032 0 

Gly His Asp Asp Ser Tyr Gly Leu Ser Trp Gly Thr Ser lie Gly Gly 
935 940 945 



ggc teg ctg tea ttg aac tgg aat caa aac aga acg ctg tgg cgc aat 
Gly Ser Leu Ser Leu Asn Trp Asn Gin Asn Arg Thr Leu Trp Arg Asn 
950 955 960 965 



tta age cgc tgg acg ggg aat aat gta agt get agt tgg cag atg act 
Leu Ser Arg Trp Thr Gly Asn Asn Val Ser Ala Ser Trp Gin Met Thr 
985 990 995 



atg cgc cag atg gga gtc aat ate gcg gga ggt ata gtt ate cac cat 
Met Arg Gin Met Gly Val Asn lie Ala Gly Gly lie Val lie His His 
1065 1070 1075 



gaa gcg cca ggg gee teg ggg gtg cca gtt ggc ggc tgg cct ggc gtt 
Glu Ala Pro Gly Ala Ser Gly Val Pro Val Gly Gly Trp Pro Gly Val 
1095 1100 1105 

aag acg gat ttt. cgt ggc gac acc aca gtg ggc aac ctg aac gtc tat 
Lys Thr Asp Phe Arg Gly Asp Thr Thr Val Gly Asn Leu Asn Val Tyr 
1110 1115 1120 1125 

cag gag aat aca gtc age etc gat ccg teg cga eta ccg gat gac gca 
Gin Glu Asn Thr Val Ser Leu Asp Pro Ser Arg Leu Pro Asp Asp Ala 

1130 1135 1140 



40368 



ggc gcg cac cgt aaa gag aac ata acc age ctg tgg ttc agt atg cca 40416 
Gly Ala His Arg Lys Glu Asn lie Thr Ser Leu Trp Phe Ser Met Pro 

970 975 980 



40464 



tea cca tea cac ggt ggt cag acg caa caa gtg ggg gtc aac gga gag 40512 
Ser Pro Ser His Gly Gly Gin Thr Gin Gin Val Gly Val Asn Gly Glu 
1000 1005 1010 

gca ttc agt cag caa ctg gat tgg gag gtg cgt cag agt tac cgt gec 40560 
Ala Phe Ser Gin Gin Leu Asp Trp Glu Val Arg Gin Ser Tyr Arg Ala 
1015 1020 1025 

gat gee ccg cca ggt ggt ggt aat aac age gca ttg cac ttg gca tgg 40608 
Asp Ala Pro Pro Gly Gly Gly Asn Asn Ser Ala Leu His Leu Ala Trp 
1030 1035 1040 1045 

aat ggg gat tac ggc ctg tta ggt ggt gac tat age tac age egg gcg 40656 
Asn Gly Asp Tyr Gly Leu Leu Gly Gly Asp Tyr Ser Tyr Ser Arg Ala 

1050 1055 1060 



40704 



cat ggt gtg acg ctg ggg caa cct ttg caa ggc tea gtg gcg ctg gtt 40752 
His Gly Val Thr Leu Gly Gin Pro Leu Gin Gly Ser Val Ala Leu Val 
1080 1085 1090 



40800 



40848 



40896 



gag gtc aca caa acc gat gtg cgc gtg gtg cca acc gaa ggg gcg gtg 40944 
Glu Val Thr Gin Thr Asp Val Arg Val Val Pro Thr Glu Gly Ala Val 
1145 1150 1155 



: cac a 



40 




gtg gaa gcg aag ttt cac act cgc ate ggg gec agg gca ctg atg acg 
Val Glu Ala Lys Phe His Thr Arg lie Gly Ala Arg Ala Leu Met Thr 
1160 - - 1165 1170 

ctg aaa egg gaa gat ggt age gec att cct ttc ggg gcg cag gtt aca 
Leu Lys Arg Glu Asp Gly Ser Ala lie Pro Phe Gly Ala Gin Val Thr 
1175 1180 1185 

gtc aat ggg cag gat ggc agt get get ctg gtg gat act gat age cag 
Val Asn Gly Gin Asp Gly Ser Ala Ala Leu Val Asp Thr Asp Ser Gin 
1190 1195 1200 1205 

gtt tat etc act ggt ttg gcg gat aag ggc gaa ctg acg gtg aaa tgg 
Val Tyr Leu Thr Gly Leu Ala Asp Lys Gly Glu Leu Thr Val Lys Trp 

1210 1215 1220 

gga gca cag caa tgt egg gtt aac tac cgc eta cct gec cac aag gga 
Gly Ala Gin Gin Cys Arg Val Asn Tyr Arg Leu Pro Ala His Lys Gly 
1225 1230 1235 

ate gcg ggc ttg tat caa atg age ggt etc tgc aga tag ccgattctga 
lie Ala Gly Leu Tyr Gin Met Ser Gly Leu Cys Arg 

1240 1245 1250 

aggagagaat a atg tgg atg aaa ata cag cga gtg aaa acg gtt ate tat 
Met Trp Met Lys lie Gin Arg Val Lys Thr Val lie Tyr 

1255 1260 

age gta age tta ctg gtc get gec agt age ttg gtg ccg ata gcg aac 
Ser Val Ser Leu Leu Val Ala Ala Ser Ser Leu Val Pro lie Ala Asn 
1265 1270 1275 

gee gca gaa aaa ctt cag aca acg eta cgt gta ggt act tac ttt egg 
Ala Ala Glu Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg 
1280 1285 1290 1295 

get ggg cac gtg cca gat ggg atg gtg ctt gcg caa ggc tgg gtg act 
Ala Gly His Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr 

1300 1305 1310 



tat cac ggc agt cac age ggg ttt egg gta tgg age gat gag caa aag 
Tyr His Gly Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys 
1315 1320 1325 

gcg ggt aac acg cct ace gta ttg' ctg ctg age ggg caa cag gat cct 
Ala Gly Asn Thr Pro Thr Val Leu Leu Leu Ser Gly Gin Gin Asp Pro 
1330 1335 1340 

cgc cat cac att cag gtt cgc ctg gag ggc gag ggg tgg caa cca gat 
Arg His His lie Gin Val Arg Leu Glu Gly Glu Gly Trp Gin Pro Asp 
1345 1350 1355 

acg gtg agt ggt cgt ggc gee att tta aga ace get gca gat aac gee 
Thr Val Ser Gly Arg Gly Ala lie Leu Arg Thr Ala Ala Asp Asn Ala 
1360 1365 1370 1375 

agt ttc agt gtg gtc gtt gat ggc aat cag gaa gtg cct gcg gac acc 
Ser Phe Ser Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr 

1380 1385 1390 



40992 



41040 



41088 



41136 



41184 



41233 



41283 



41331 



41379 



41427 



41475 



41523 



41571 



41619 



tgg acg ctg gat ttt aag gee tgt gca ttg gcg cag gag gat acg tag 



41715 



Trp Thr Leu Asp Phe 
1395 




mm 

Ala Cys Ala Leu Ala Gin Glu Asp Thr 
1400 1405 




ccgtctgttc cactcatact tcctgtatca gtgaataagg cgaatactgt atatcaggtt 41775 



cgaggatgta ttgaaaattt actcaaggca aatatggtta gcaatcttca ttgcagaaaa 41835 



atcaattgct gattgatttt aaaatctgaa attcatcttt tttgtaggga gaggaatatt 41895 
atgtttcgta aaataattta acggacactg atgcctttat tatttctgtt ttgtggattt 41955 
gtttgattca ctcttttgtg agcggggctt tataagcccc tggtataagg tttgtttatc 42015 
tgtcaggtat tcatatgtga tatttaagat tttatgttta gggggctttt tcttgtcccc 42075 
tcaggcgtaa aaataattta ttttttacat aaaggaataa agcatatgtc ttatgcacga 42135 
catttaccgg tattaatgta tcaccatgtc agtgataaac ccggacagat aaccttatct 42195 
ccccgtacgt tccgggcgca gatgaaatgg ctggccgaat ctggctggaa aaccgttact 42255 
gcctgcagag gtggaagctt tttatcatgg tgcaagattg cctcgtaaaa gcgtcatgct 42315 
gacctttgat ggcggctggc tggataactg gttgcaggtt tttccggtgc tgcaggagtt 42375 . 
taatctgcat gcgcatctct ttcttgtgac cagtttgatc agtgacggac cggtccgtat 42435 
tcctgcaggc gaaccggtgt actctcatga tgagtgtcaa atgctggtta aacaaggccg 42495 
ggctgatgag gtcatgctgc gctggtcaga ggtccgggag atgcacctca gtggccttgt 42555 
tgagtttcac tcgcacacgc acacccaccg acgctgggac cagaagcctg tgtcccgtaa 42615 
tccgtcggat ttgcttcgtg tcgatattct tcttagtcgt aagcggatga gggagatgct 42675 
gggttattgc agtcagcatc tgtgctggcc tgagggctgg tattgttctg actatattca 42735, 
tgtggctgaa gagttggggt tcacatacct gtataccaca gaaaggcgta tgaacaatcc 42795 
agtcatcggt tcacagcgta ttggtcgtat caacgcaaag gagcgaaaga atgtgggctg 42855 
gctgaaacgt cgtctgtttt atcacaccac gcccggattt tcttcgctgc tggcccggca 42915 
taagggggca cgtcggatag ctgactgagc cagagaccag gatgaaagtg ctgcataccg 42975 
aatcctcccc catcatcggc gggcaggggt tgtaggctat atcccaaatg atggtgctga 4303 5 
ttttgagcag aaataagata acaccggagg ccgtgataga cccggttgcg cagaatctgg 43095 
aggcagcagg tcactggcgc agggcttcta cccgctggct tttggttatg ggagattttg 43155 
aatgtacaga ggcccagcgg gagtggttgt tgttgcgccg gaattattgc cttgcgcaga 43215 
tatcttcccc tggcgccagt aaagctggat atcagcgacg tgacgaaatg agaggagcct 43 275 
gcacataggt ttgtgttttg agtaagtgtc ttatttaaac cgtctgttct gtttcctccg 43335 
ctttcacaaa taatgtcgag ccgggtgggg gactcaagta agaataatct ggcgatgttt 43395 
tgcttgtttc cacgggatac tttgttaggt gaacgataca attaatgcgc tccagactgc 43455 



% 

:gc ^tx 



gtcacagaaa aggagaatgc^ffttgcagaa cgggcgcatg acaatgaacc actttattat 43515 
tggtatctgg ttcggcgaca accttgacga tcacgtctga gatagggtat ttacactgag 43575 
tggtaaacag gttcaattag taaccggaga tggatgcaaa atcatgatcg attcagatgc 43 63 5 
tattttgcag ccaatagatt ttttattaag atgatatcaa ggacattgag gcacacaatg 43695 
acgtatcagt aagtcgttga tagctcattt gatataagaa tttcttttat caacggaaga 43 755 
taatgatgga actgatcaat aatcgtggta. tgcgagactg gatgatattt attaaagtgg 43 815 
cggaagtagg gaatctttcc cgggctgcgc gggaattaga tattagcatt tctgctgtca 43 875 
gtaaatcgct tagtcgcctt gagaattcta ttgaggttac tttacttcgg cgcgattcac 43 93 5 
atcacttaga actgactgga gctggtcaga cagcctatgc aagcatgaaa aggataacat 43 995 
cttcctttca gtccttgctg gatgaattgc gaaatccgga taaaattatc agagggagta 44055 
taaaattttc ggctccggct attgtctgtg agtttcttgc caataagtgg atatgggaat 44115 
ttacagctag ctatccggat acaaaaatct acctggattc acgagagcgt agcgattttt 44175 
ttagtaaatc cctggagttt gatgagctgg tttttaaaag tggcataatc gaaagtgagg 44235 
atctcgtgta tcgaaagata agccctttaa agttggttct ttgtgcgagt ccgaaatata 44295 
tcagaaaata tggcaggatc tcacaccctg gcgatttgga aaatcacatt attgtgggtc 44355 
ttcacaacca tggtctttcc ggacctctta ctcttttccg tcaggatgaa tcatacacta 44415 
ttagtggcgc tgttaatgtt catttatctt ccaataatct tttgagtgtt cttaatttgg 44475 
ttttagaagg aaagggtatc aacctcatga ctccggcctg gcttgccacc aaatacttaa 44535 
aaaataatga acttgaaatt atacttcctg aatggagggt tccagatctc cccatttatc 44595 
ttgtatggcg tcatcgtcag tattattctc ctttatttca acgctttctg tcttttattg 44655 
aagataaatg gaataatcgc ccacaaattg attttctgaa tgatgattaa cccgtttgga 44715 
atggttttga tacgttcctg acttaaaccc acatgatgac tgaattgagg catcgagata 44775 
tgcgactggt cagccagtcg tcttttgacg atgcccaata caaaacatga. tgccgactga 44835 
cggaaatgat aatacgcgga aacaggacgg ggctgttttt gggcagccgg aagttaagcc 448 95 
cataccagaa acgttgcagt gtactgaaaa atggcgccag gttgcacctg ttcaaagatt 44955 
ttctgaaggc gcaggagtat tcattactga tatcctccat tgcgccttcg ggaacccaca 45015 
ggaccagcta ttttaccgat agtgtttaaa aggcgtaagt aatgccgagc atgaagtcat 45075 
tggaggcagc ctttgtgtct gcatcataag cggtatgttc atcaccagca tagtgatttt 45135 
ttgaaatgct tactttgcca gcattaatgt atttataact ggcgtcaatc ataatattat 45195 
ctgttacagc atattttgca ccgatacctg cgccccaggc aaagttattt tttgaagcag 45255 
acagagtttc attaatacca aaaccaacag gaatggtgtt attacttagc ttcacatgag 45315 



cgaggccaac gcctgcgctg atatagggag taaatgccgt actattgtga aaatcataat 45 375 
agccattaac catgtaagtg gtcattcgga cctgattttt tacatttatg tgtactggat 45435 
caccaaatgc aataatatcc tgcccgcctt tagcatccgt ctcacctctg aaagtggtat 45495 
ccagttctaa acgtactgga agctggaatg gatcataaaa gtcataaccg atagcaaccc 45555 
cgccgccaaa aacgcctttg gtacggtcag gtaacgttgc atgaccatta actatctcat 45615 
cctggctgaa ggttgagttg attccataga cattgactac ggatgtcccc gctttcccgg 45675 
tgatatagat cccttctttt gctgatgcag tagcggacca ggctaccaca aggggaatga 45735 
tgcagactgc gaaaaagttt ttcatttcag' aacctgcctt aatattgggc taaaagacaa 45795 
gtttcacggt atagggtgtg atataacgat tacataaacg aagcccaaaa aacggtctat 45 855 
tgtaacgctg ggttttctgt aagcgggtaa aaaatgagat gaagatttta aataacaata 45 915 
cgataatcgt cggtatggaa atccatctcc tcgccaaatt gccccacgta cggtttcact 45975 
tctacgttat gtaacgggta gtgtgagatg gagcgatgct gtaagaaaaa gatgaagatg 46035 , 
aatttgtacc cgacctggat aaagcccgtt atcccggaat aacgggcaaa aatatttact 46095 
caagtgcctg ggcgagatct tgttgtacct gttgacgctg ttctggtgtt aagactttgc 46155 
ttaaatcaaa ataatattta acccgataat agcgagcctg ttgttctatg ttactgaagg 46215 
ctgcaagctg ctgttttacg gcggcgtcat cccatttacc ggatttaatc acctctatca 46275 
gcgcaccgtc tttaattccc ttcatagaaa tctgactgac gtcggtttcc agttgttggt 46335 
gaagtttttt gatccgggta atctgatcgt ttgtcagctt cagatgctgg acaataggat 463 95 ^ 
cctgggcggg caggggagga ttggggacag cggtggcgaa agcgccaaaa gaaacgcccg 4 6455 
ccagagtcgc tgccagtaaa gttgtgcgta caaagttttt catgaagata tcctgataag 46515 
ggagtgatta accgttttta ttacccacga atggcgagca attatcttag agcctatccc 46575 
agtagggcta ttttacttgc cattttggac ctgggcagtg ctcgccaaaa cgcgttagcg 46635 
ttttgaacgc cgctagcggc ggcccgaagg gcgagcgtag cgagtcaaac ctcacgtact 46695 
acgtgtacgc tccggttttt gcgcgctgtc cgtgtccaaa ctggctgcgc caataacgcc 46755 
tggtgggata ggctcttagt cagaatacgt tgcctgccac attacgccac gcgaatttgt 46815 
tttacggaga gttacggagt gaaacaatcc cgccgcggtg agcggcaggt tgctt 46870 

<210> 2 
<211> 166 
<212> PRT 

<213> Salmonella typhimurium 
<400> 2 

Met Lys Ser lie Lys Lys Leu lie lie Ala Ser Ala Leu Ser Met Met 



1 

Ala Ala Ser Cys 
20 

Lys Ser Val Asp 
35 

Leu lie Pro Val 
50 

Lys lie Ala Lys 
65 

Val Arg Gly lie 



Arg Val Ala Gly 
100 

Ser Asp Ser Leu 
115 

Asn Trp Met Thr 
130 

Gly Pro Ala Gin 
145 

Val Val Gly Tyr 



Tyr Ala Gly Ser 



lie Val Phe Ser 
40 

Ser Gly Leu Lys 
55 

Leu Val Val Asn 
70 

Ser Asn Asn Val 
85 

Lys Asn Thr Gly 



Arg Arg Ser Asp 
120 

Phe Asn Ser Asn 
135 

Asn Val Thr Ala 
150 

Gin Pro 
165 



44 

10 

Phe Leu Pro Asn 
25 

Ser Pro Gin Asp 



Ala Gly Lys Asn 
60 

Ser Thr Thr Leu 
75 

Val Asp Ser Thr 
90 

Lys Glu lie Gly 
105 

Ser Thr Glu Lys 



Asp Thr Leu Asp 
140 

Asp Thr Tyr Pro 
155 



^^^5 

Ser Glu Gin Gin 
30 

Leu Thr Val Ser 
45 

Ala Pro Ser Ala 



Lys Glu Phe Gly 
80 

Gly Thr Ala Trp 
95 

Val Gly Leu Ser 
110 

Trp Asn Gly Val 
125 

He Val Leu Thr 



lie Thr Leu Asp 
160 



<210> 3 

<211> 245 

<212> PRT 

<213> Salmonella typhimurium 



<400> 3 

Met Lys He Val 
1 

Asn Ser Met Val 
20 

Thr Lys Leu Phe 
35 

Ala Gly Thr Ala 
50 

Pro He Leu Val 
65 

Ala Pro Phe Leu 



Gin Ser Gin Leu 
100 

Arg Glu Thr Leu 



Asn Phe Ala Val 
5 

Ser Val Tyr Ala 



Ser Val Lys Leu 
40 

Gly Ala Thr Leu 
55 

Gin Ser Ser Val 
70 

Val Met Pro Pro 
85 

Arg He Val Arg 



Gin Trp Val Cys 



Met Ala Val Ala 
10 

Val Asn Gin Gin 
25 

Gly Ala Thr Arg 



Ser Val Ser Asn 
60 

Lys Ala Ala Asp 
75 

Leu Phe Arg Leu 
90 

Thr Gly Gly Asp 
105 

He Lys Ala Val 



Leu Phe Ala Thr 
15 

Leu Asn Ser Ala 
30 

Val He Tyr His 
45 

Pro Gin Asn Tyr 



Lys Ser Ser Pro 
80 

Glu Ala Asn Gin 
95 

Met Pro Thr Asp 
110 

Pro Pro Glu Asn 



115 



120 125 ^^^^ 



Glu Pro Ser Asp Thr Gin Ala Lys Gly Ala Thr Leu Asp Leu Asn Leu 
130 " " 135 140 

Ser lie Asn Ala Cys Asp Lys Leu lie Phe Arg Pro Asp Ala Val Lys 
145 150 155 160 

Gly Thr Pro Glu Asp Val Ala Gly Asn Leu Arg Trp Val Glu Thr Gly 

165 170 175 

Asn Lys Leu Lys Val Glu Asn Pro Thr Pro Phe Tyr Met Asn Leu Ala 
180 185 190 

Ser Val Thr Val Gly Gly Lys Pro lie Thr Gly Leu Glu Tyr Val Pro 
195 200 205 

Pro Phe Ala Asp Lys Thr Leu Asn Met Pro Gly Ser Ala His Gly Asp 
210 215 220 

lie Glu Trp Arg Val lie Thr Asp Phe Gly Gly Glu Ser His Pro Phe 
225 230 235 240 

His Tyr Val Leu Lys 

245 



<210> 4 
<211> 836 
<212> PRT 

<213> Salmonella typhimurium 
<400> 4 

Met Lys Phe Lys Gin Pro Ala Leu Leu Leu Phe lie Ala Gly Val Val 
15 10 15 

His Cys Ala Asn Ala His Thr Tyr Thr Phe Asp Ala Ser Met Leu Gly 
20 25 30 

Asp Ala Ala Lys Gly Val Asp Met Ser Leu Phe Asn Gin Gly Leu Gin 
35 40 45 

Gin Pro Gly Thr Tyr Arg Val Asp Val Met Val Asn Gly Lys Arg Val 
50 55 60 

Asp Thr Arg Asp Val Val Phe Lys Leu Glu Lys Asp Gly Gin Gly Thr 
65 70 75 80 

Pro Val Leu Ala Pro Cys Leu Thr Val Ser Gin Leu Ser Arg Tyr Gly 

85 90 95 

Val Lys Thr Glu Asp Tyr Pro Gin Leu Trp Lys Ala Ala Lys Pro Pro 
100 105 110 

Asp Glu Cys Ala Asp Leu Thr Ala lie Pro Gin Ala Lys Ala Val Leu 
115 120 125 

Asp lie Asn Asn Gin Gin Leu Gin Leu Ser lie Pro Gin Leu Ala Leu 
130 135 140 

Arg Pro Glu Phe Lys Gly lie Ala Pro Glu Asp Leu Trp Asp Asp Gly 



% 



46 




145 155 ' ^"^^ 160 

He Pro Ala Phe Leu Met Asn Tyr Ser Ala Arg Thr Thr Gin Thr Asp 

165 - - 170 175 

Tyr Lys Met Asp Met Val Gly Arg Asp Asn Ser Ser Trp Val Gin Leu 
180 185 190 

Gin Pro Gly He Asn He Gly Ala Trp Arg Val Arg Asn Ala Thr Ser 
195 200 205 

Trp Gin Arg Ser Ser Gin Leu Ser Gly Lys Trp Gin Ala Ala Tyr Thr 
210 215 220 

Tyr Ala Glu Arg Gly Leu Tyr Ser Leu Lys Ser Arg Leu Thr Leu Gly 
225 230 235 240 

Gin Lys Thr Ser Gin Gly Glu He Phe Asp Ser Val Pro Phe Thr Gly 

245 250 255 

Val Met Leu Ala Ser Asp Asp Asn Met Val Pro Tyr Ser Glu Arg Gin 
260 265 270 

Phe Ala Pro Val Val Arg Gly He Ala Arg Thr Gin Ala Arg Val Glu 
275 280 285 

Val Lys Gin Asn Gly Tyr Thr He Tyr Asn Thr Thr Val Ala Pro Gly 
290 295 300 

Pro Phe Ala Leu Arg Asp Leu Ser Val Thr Asp Ser Ser Gly Asp Leu 
305 310 315 320 

His Val Thr Val Trp Glu Ala Asp Gly Ser Thr Gin Met Phe Val Val 

325 330 335 

Pro Tyr Gin Thr Pro Ala lie Ala Leu His Gin Gly Tyr Leu Lys Tyr 
340 345 350 

Ser Leu Leu Ala Gly Arg Tyr Arg Ser Ser Asp Ser Ala Thr Asp Lys 
355 360 365 

Arg Gin He Ala Gin Ala Thr Leu Met Tyr Gly Leu Pro Trp Asn Leu 
370 375 380 

Thr Ala Tyr Gly Gly He Gin Ser Ala Thr His Asn Gin Ala Ala Leu 
385 390 395 400 

Leu Gly Leu Gly Gly Ser Leu Gly Arg Trp Gly Ser Leu Ser Val Asp 

405 410 - 415 

Gly Ser Asp Thr His Ser Gin Arg Gin Gly Glu Ala Val Gin Gin Gly 
420 425 430 

Ala Ser Trp Arg Leu Arg Tyr Ser Asn Gin Leu Thr Ala Thr Gly Thr 
435 440 445 

Asn Phe Phe Leu Thr Arg ; Trp Gin Tyr Ala Ser Gin Gly Tyr Asn Thr 
450 455 460 



Leu Ser Asp Val Leu Asp Ser Tyr Arg His Asn Gly Asn Arg Leu Trp 
465 470 475 480 



% • % 

Ser Trp Arg Glu Asn Leu Gin Pro Ser Ser Arg Thr Thr Leu Met Leu 

485 490 495 

Ser Gin Ser Trp Gly Arg His Leu Gly Asn Leu Ser Leu Thr Gly Ser 
500 505 510 

Arg Thr Asp Trp Arg Asn Arg Pro Gly His Asp Asp Ser Tyr Gly Leu 

515 520 525 



Ser Trp Gly Thr Ser 
530 

Gin Asn Arg Thr Leu 
545 

Thr Ser Leu Trp Phe 

565 

Val Ser Ala Ser Trp 
580 

Gin Gin Val Gly Val 
595 

Glu Val Arg Gin Ser 
610 

Asn Ser Ala Leu His 
625 

Gly Asp Tyr Ser Tyr 

645 

Ala Gly Gly lie Val 
660 

Leu Gin Gly Ser Val 
675 

Pro Val Gly Gly Trp 
690 

Thr Val Gly Asn Leu 
705 

Pro Ser Arg Leu Pro 

725 

Val Val Pro Thr Glu 
740 

lie Gly Ala Arg Ala 
755 

He Pro Phe Gly Ala 
770 



He Gly Gly Gly Ser Leu 
535 

Trp Arg Asn Gly Ala His 
550 555 

Ser Met Pro Leu Ser Arg 

570 

Gin Met Thr Ser Pro Ser 
585 

Asn Gly Glu Ala Phe Ser 
600 

Tyr Arg Ala Asp Ala Pro 
615 

Leu Ala Trp Asn Gly Asp 
630 635 

Ser Arg Ala Met Arg Gin 

650 

lie His His His Gly Val 
665 

Ala Leu Val Glu Ala Pro 
680 

Pro Gly Val Lys Thr Asp 
695 

Asn Val Tyr Gin Glu Asn 
710 715 

Asp Asp Ala Glu Val Thr 

730 

Gly Ala Val Val Glu Ala 
745 

Leu Met Thr Leu Lys Arg 
760 

Gin Val Thr Val Asn Gly 
775 



Ser Leu Asn Trp Asn 
540 

Arg Lys Glu Asn He 

560 

Trp Thr Gly Asn Asn 
575 

His Gly Gly Gin Thr 
590 

Gin Gin Leu Asp Trp 
605 

Pro Gly Gly Gly Asn 
620 

Tyr Gly Leu Leu Gly 

640 

Met Gly Val Asn He 
655 

Thr Leu Gly Gin Pro 
670 

Gly Ala Ser Gly Val 
685 

Phe Arg Gly Asp Thr 
700 

Thr val Ser Leu Asp 

720 

Gin Thr Asp Val Arg 
735 

Lys Phe His Thr Arg 
750 

Glu Asp Gly Ser Ala 
765 

Gin Asp Gly Ser Ala 
780 



Ala Leu Val Asp Thr Asp Ser Gin Val Tyr Leu Thr Gly Leu Ala Asp 
785 790 795 800 



Lys Gly Glu 



Leu 



Thr 
805 




Lys Trp Gly Ala Gin Gin Cys 
810 



48 




Val Asn 
815 



Tyr Arg Leu Pro Ala His Lys Gly lie Ala Gly Leu Tyr Gin Met Ser 
820 825 830 

Gly Leu Cys Arg 



<210> 5 
<211> 156 
<212> PRT 

<213> Salmonella typhimurium 
<400> 5 

Met Trp Met Lys lie Gin Arg Val Lys Thr Val lie Tyr Ser Val Ser 
15 10 15 

Leu Leu Val Ala Ala Ser Ser Leu Val Pro lie Ala Asn Ala Ala Glu 
20 25 30 

Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg Ala Gly His 
35 40 45 

Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr Tyr His Gly 
50 55 60 

Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys Ala Gly Asn 
65 70 75 80 

Thr Pro Thr Val Leu Leu Leu. Ser Gly Gin Gin Asp Pro Arg His His 

85 90 95 

lie Gin Val Arg Leu Glu Gly Glu Gly Trp Gin Pro Asp Thr Val Ser 
100 105 110 

Gly Arg Gly Ala lie Leu Arg Thr Ala Ala Asp Asn Ala Ser Phe Ser 
115 120 125 

Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr Trp Thr Leu 
130 135 140 

Asp Phe Lys Ala Cys Ala W Ala Gin Glu Asp Thr 
145 150 155 



835 



- . SEQUENCE LISTING NO. 2 

<110> Folkesson, Anders 

<120> The complete sequence of the tcf insert of Salmonella 
enterica serovar Typhi. 

<13 0> The tcf insert in Salmonella typhi 

<140> 
<141> 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 9253 
<212> DNA 

<213> Salmonella typhi 

<220> 
<221> CDS 

<222> (1898) . . (2608) 

<223> tcf A putative fimbrial subunit 

<220> 

<221> CDS 

<222> (2659) . . (3234) 

<223> tcfB putative fimbrial subunit 

<220> 

<221> CDS 

<222> (3360) . . (6029) 

<223> tcfC putative fimbrial subunit 

<220> 

<221> CDS 

<222> (6052) . . (7131) 

<223> tcfD putative fimbrial subunit 

<220> 

<221> CDS 

<222> (7264) . . (7719) 

<223> tinR putative transcriptional regulator 

<400> l 



tgtaagtatc 


ccgcataatc 


gagccattca 


catttagaga 


tcatccggca 


taatcaatct 


60 


gccaacgcag 


gagatcgctg 


tgcgtaaagc 


ccgtattact 


gcgcaccaga 


tcatcgctgt 


120 


gattagatca 


gttgaatccg 


gacggactgt 


taaagatgtc 


taccgggagg 


ccggtatttc 


180 


tgaagccacc 


agggacaact 


ggaagtctgg 


atacggcggc atggaagctt 


ctgatattaa 


240 


atcttgagga 


tgtcaacgcc 


aggatttatg 


gtcgtttttc 


gctattttta 


atatccgctg 


300 


tttgatcact 


tctgctgtcc 


gctttccgcc 


atttcatctt 


cactgattgg 


cgttgcgctt 


360 


ttggtcagcg 


ccccgacttt 


gcgtttttcc 


ttcagttggt 


aactttctcc 


tttgatattc 


420 



agtgtggttg agtgatgcag tagccgatcc aggatcgctg ttgccagcac gttatcaccg 4 80 

aacatctctc cccagtctgc gaagcctttg tttgacgtca ggataatgct cgctttttca 540 

taccgtcggt tcagcagtcg gaagaacaga ctggcttcct cactggtcat tggcaggtaa 600 

cctatttcgt ccaggatcag cccccgcgca tagctcagtt gttgtagctg gcgctccagg 660 

cggttttcca gcttcgcttt catcagcgtc taccaacctg tcctgcggca tgaacaacac 720 

ccgatggccc acatctgccg ctttcacacc gggggcagcg gccaggtggg ttttctccac 780 

tccgggggcc ccagcaggat cacattctcg cagcgctcca cgaacgccag accggccaac 84 0 

tcccggacga ccttacgatc gatgcctggc tggaagctga agtcgaactg ctccagcgtt 900 

ttgactcatg gcagacgagc ctgtttcagc cgggattcca ttccgcgctg atgtctgccg 960 

ttctattcct gctgcagcgc catgcacagg aatgtcgact tattttgggc tgacagaggc 1020 - 

attggtcgtt aagacttgta tattcatgtt gcttactcgc acaaaagggg gaaaggaacc 1080 

gtaaacaaca ttgacaggta agggactggc atcaatttga atggtatttt ttataacatt 1140 

atgttttaaa aggtgaattg tattttcatg gtgagttgtt ctgttttata ttgttgtttg 1200 

tggaatattt agctaatatt taatctaaaa atagtgtatt atattgatta cactttgctg 1260 

gagaggtgta aacatcaagc gtaaccatat gatgtatata agtttttgtt tgctgatatc 1320 

atatttttaa cagatattga gcaagtaaat ctgatctaac tgtaacttta gttttctaaa 1380 

ttaattactt tgcttacttt tttttacctg gttttgtaaa acctcatgat gatcagtcta 1440 

tttgtggctt tgtttaaggt ttatttggtt ttgtagtctg aattgtatct ccctcctgga 1500 

gggctttctt tagattctct gtctcctacg ttgttatgtt acgtatttgt tgctttgaag 1560 

gagggggaaa tacagttcca tttatctgag taagtcaggt acacagtaac aactttctta 1620 

tgaagaattt ccaaaatttt tactgcggcg ttattaattg ttcagcgatt cttacagatc 1680 

tgtcgttcgc ttttggtgaa tgaaatccgt ggacttttat ttactaattt tttctttcct 1740 

gaaaaaaaca gaggtattga gcgaaaaatt ttattccgta tgatgccctc cacacaaaat 1800 

gtattaacac tgaatcgtaa tttgcttctt tatgctgata actttctgtc tatgctaata 1860 

ctaaaattta gatgactttt atacggtaaa atctggt atg aat ttt aaa gat act 1915 

Met Asn Phe Lys Asp Thr 
1 5 

ctt ccc ggg gtg ttt etc tgt gtc get atg ttt gca tgt ggt cat gec 1963 
Leu Pro Gly Val Phe Leu Cys Val Ala Met Phe Ala Cys Gly His Ala 
10 15 20 

agg gcg aat atg etc gtt tat ccc atg gcg gca gaa att aat agt age 2011 
Arg Ala Asn Met Leu Val Tyr Pro Met Ala Ala Glu lie Asn Ser Ser 
25 30 35 
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cgc gaa gag gcc acc teg ctg ttc gtc tat tct aaa tea gat cat gtg 205 9 
Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr Ser Lys Ser Asp His Val 
40 45 50 

caa tat att cga aca aga ate atg cgt att gaa cac ccc ggt atg cca 2107 
Gin Tyr He Arg Thr Arg lie Met Arg He Glu His Pro Gly Met Pro 
55 60 65 70 

cag gag aag gag gta cca gca ggg aat gat ata gag aca gga ctt gtt 2155 
Gin Glu Lys Glu Val Pro Ala Gly Asn Asp He Glu Thr Gly Leu Val 

75 80 85 

gtc tec ccg gag aaa ttt get ctt tec ccg gga aca aaa aaa aca ata 2203 
Val Ser Pro Glu Lys Phe Ala Leu Ser Pro Gly Thr Lys Lys Thr He 
90 95 100 

cgt gtt ate agt act cag gca ccg gaa aga gag gaa gcc tgg egg gta 2251 
Arg Val He Ser Thr Gin Ala Pro Glu Arg Glu Glu Ala Trp Arg Val 
105 110 115 



tac ttc gag get gtt cct gaa ctg gaa gat gat cca cag gca ggc gga 
Tyr Phe Glu Ala Val Pro Glu Leu Glu Asp Asp Pro Gin Ala Gly Gly 
120 125 130 

aag caa aat tea tec gta agt gtg aat ctt gtc tgg ggg gtg ttg ctg 
Lys Gin Asn Ser Ser Val Ser Val Asn Leu Val Trp Gly Val Leu Leu 
135 140 145 150 

cgt gtt tct ccg tea gac ccc agg cct gcg ctg gta acg gac ggt cac 
Arg Val Ser Pro Ser Asp Pro Arg Pro Ala Leu Val Thr Asp Gly His 

155 160 165 



aac tgc gac acc aca tgc cac tgg cag aat ata ggc aaa agt att tat 
Asn Cys Asp Thr Thr Cys His Trp Gin Asn He Gly Lys Ser He Tyr 
185 190 195 



cgt gtg gaa tat cgt acg ggt gca aat tea ccg gta ate tct get gat 

Arg Val Glu Tyr Arg Thr Gly Ala Asn Ser Pro Val He Ser Ala Asp 
215 220 225 230 

tta aca gca gcc gga aag taa aaacacaegg agegtacget ataccctaca 
Leu Thr Ala Ala Gly Lys 

235 



2299 



2347 



2395 



cac ctg ctg aat acg gga aac aca egg ctt tct ctt att egg get ggc 2443 
His Leu Leu Asn Thr Gly Asn Thr Arg Leu Ser Leu He Arg Ala Gly 
170 175 180 



2491 



ccc ggc ggg agt get gat att ccg gcc gga ata aaa agt aat gca ttt 2539 
Pro Gly Gly Ser Ala Asp He Pro Ala Gly He Lys Ser Asn Ala Phe 
200 205 210 



2S87 



2638 



tttattctca gggggagegg atg tat acc gag tgt aca tat ate act gta ata 2691 

Met Tyr Thr Glu Cys Thr Tyr He Thr Val He 
240 245 

aac aac aaa gca agg tta ttt ttt atg aac atg aaa aca tct ttt att 2739 
Asn Asn Lys Ala Arg Leu Phe Phe Met Asn Met Lys Thr Ser Phe He 
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250 - ^255 260 



gcc gca get gtg gca ttg gec acc gtt tat tct ttt tct gtt tct gcg 2787 
Ala Ala Ala Val Ala Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala 
265 270 275 280 



gtt cag aag gat att acc gtc act gcc aat att gac agt aca ctt gaa 

Val Gin Lys Asp lie Thr Val Thr Ala Asn lie Asp Ser Thr Leu Glu 

285 290 295 

ctg ctg cag gcc gat ggt tea tec etc ccg teg act atg aag ctg gat 

Leu Leu Gin Ala Asp Gly Ser Ser Leu Pro Ser Thr Met Lys Leu Asp 
300 305 310 

ttc atg ccg ggt aag ggc ctg gtc cat aaa tea etc cag acc cgc ctt 

Phe Met Pro Gly Lys Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu 
315 320 325 

tac age aac gat cag acc aag teg gtt aat gta aaa ctg ttg aat get 

Tyr Ser Asn Asp Gin Thr Lys Ser Val Asn Val Lys Leu Leu Asn Ala 
330 335 340 

cca caa ctt ate aac gtc ctg gat ccc acc aaa acc att gat atg gaa 

Pro Gin Leu lie Asn Val Leu Asp Pro Thr Lys Thr lie Asp Met Glu 

345 350 355 360 



get aaa acc ctg ttc ccg gac gga aaa act ggc gat get tea get ctg 
Ala Lys Thr Leu Phe Pro Asp Gly Lys Thr Gly Asp Ala Ser Ala Leu 
380 385 390 

ctg aac ctg gat att ggt cag aag get gga gca gcc tta caa aac ctg 
Leu Asn Leu Asp lie Gly Gin Lys Ala Gly Ala Ala Leu Gin Asn Leu 
395 400 405 



2835 



2883 



2931 



2979 



3027 



gtg act ctg gga gga egg tea ctg acc acc acc- aat tct gta ctg gaa 3075 
Val Thr Leu Gly Gly Arg Ser Leu Thr Thr , Thr Asn Ser Val Leu Glu 

365 370 375 



3123 



3171 



cct gcc ggt gaa tac age gga ttg gtc agt ctg gtg att tea cag get 3219 
Pro Ala Gly Glu Tyr Ser Gly Leu Val Ser Leu Val lie Ser Gin Ala 
410 415 420 

gtc act gcc ggc taa taactggtta ttagctcttc atctgatccg gttttggggg 3274 

Val Thr Ala Gly 

425 

gcaccgttcg tacctgaacc ggatccggta ttgatcttat tattcattgc aattcaggtc 3334 

tetttaegtg agtcgttatt tctgg atg tat tat tta ctg gga ttg tgc agt 3386 

Met Tyr Tyr Leu Leu Gly Leu Cys Ser 
430 435 



ttt acc age cag gca act ctt att ccc, cct cct gga ttt gaa tct ctg 
Phe Thr Ser Gin Ala Thr Leu lie Pro Pro Pro Gly Phe Glu Ser Leu 
440 445 450 

ctg gaa gga cag act gag caa att gaa gtg ttg eta cca ggg cat tea 
Leu Glu Gly Gin Thr Glu Gin He Glu Val Leu Leu Pro Gly His Ser 
455 460 465 470 



3434 



3482 
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ctg gga tta ttt ccg gtg- gtg gtt aaa ccg gac acc gtg cag ttc atg 3530 
Leu Gly Leu Phe Pro Val Val Val Lys Pro Asp Thr Val Gin Phe Met 

475 480 485 

tec cca ttg atg gta ctt gaa age agt ggg ctt gec gcg ttg ccg gec 3578 
Ser Pro Leu Met Val Leu Glu Ser Ser Gly Leu Ala Ala Leu Pro Ala 
490 495 500 

gca gaa egg caa aaa gcg ctg get gca etc age cgt ccg ttg eta cgt 3626 
Ala Glu Arg Gin Lys Ala Leu Ala Ala Leu Ser Arg Pro Leu Leu Arg 
505 510 515 

aac age aat ctg gtc tgt ggt gtc tea gaa gca aaa gac age age gag 3674 
Asn Ser Asn Leu Val Cys Gly Val Ser Glu Ala Lys Asp Ser Ser Glu 
520 525 530 

tgt ggt tac gtg gca aca gat aaa gag gat gtt gcg gtt att ttt gat 3722 
Cys Gly Tyr Val Ala Thr Asp Lys Glu Asp Val Ala Val lie Phe Asp. 
535 540 545 550 

gag aac aac get cag tta tct ttg ttt ctt aac egg gac tgg ttg ccg 3770 
Glu Asn Asn Ala Gin Leu Ser Leu Phe Leu Asn Arg Asp Trp Leu Pro 

555 560 565 

gat gaa gaa cga cgt gat aaa cgc tgg ctg act ccg acc ccg gag ggt 3818 
Asp Glu Glu Arg Arg Asp Lys Arg Trp Leu Thr Pro Thr Pro Glu Gly 
570 575 580 

gtc age gca ttt att cac cgc cag acg ctg tat ctg agt gat gat etc 3866 
Val Ser Ala Phe lie His Arg Gin Thr Leu Tyr Leu Ser Asp Asp Leu 
585 590 595 

cac agt cgt aat atg aca ctg aat ggt age ggt gec ctg ggg ctt ggt 3 914 
His Ser Arg Asn Met Thr Leu Asn Gly Ser Gly Ala Leu Gly Leu Gly 
600 605 610 

gac ggt cgt tat ctg gga ggc gac tgg gcg get ate tgg aat cag tea 3962 
Asp Gly Arg Tyr Leu Gly Gly Asp Trp Ala Ala lie Trp Asn Gin Ser 
615 620 625 630 

gaa cat tac aat aac agt cag gec tgg ttt gac aat ctg ttt gtc cgt 4010 
Glu His Tyr Asn Asn Ser Gin Ala Trp Phe Asp Asn Leu Phe Val Arg 

635 640 645 

cag gat etc ggc aat cag tat tat etc cag get ggt egg atg gat cag 4058 
Gin Asp Leu Gly Asn Gin Tyr Tyr Leu Gin Ala Gly Arg Met Asp Gin 
650 655 660 

egg aat ctg tec age gec acg ggg ggg gat ttt ggg ttc agt ctg ctt 4106 
Arg Asn Leu Ser Ser Ala Thr Gly Gly Asp Phe Gly Phe Ser Leu Leu 
665 670 675 

ccc ctg age egg ttt gat gga tta cga acc ggg acc acc caa get tat 4154 
Pro Leu Ser Arg Phe Asp Gly Leu Arg Thr Gly Thr Thr Gin Ala Tyr 
680 685 690 

gtt aac cat gag gtg gac cat aat gec act ccg gtt atg gtt cag gtt 4202 
Val Asn His Glu Val Asp His Asn Ala Thr Pro Val Met Val Gin Val 
695 700 705 710 
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acc cga aat gcc cgt att -gat att tat cgt ggc age gag ttg ctg ggg 4250 
Thr Arg Asn Ala Arg lie Asp lie Tyr Arg Gly Ser Glu Leu Leu Gly 

715 720 725 

agt cag ttc ctg acc ccg gga atg cat acc ctg gat act cat tct ctt 4293 
Ser Gin Phe Leu Thr Pro Gly Met His Thr Leu Asp Thr His Ser Leu 
730 735 740 

cca ccg gga age tat cct ctg gcg ttg egg gtg tat gag gat ggg att 434 6 
Pro Pro Gly Ser Tyr Pro Leu Ala Leu Arg Val Tyr Glu Asp Gly lie 
745 750 755 

ctg egg cga acg gag acc cag ccc ttc agt aag ggg ggc aat age ttc 43 94 
Leu Arg Arg Thr Glu Thr Gin Pro Phe Ser Lys Gly Gly Asn Ser Phe 
760 765 770 

agt gca cag acc cag tgg ttt att cag ggc ggg ctg gaa gat acc ggg 444 2 
Ser Ala Gin Thr Gin Trp Phe He Gin Gly Gly Leu Glu Asp Thr Gly 
775 780 785 790 

gat aaa gcc age cat tat gac ggt gag act gtc atg get gcc gga ttc 44 90 
Asp Lys Ala Ser His Tyr Asp Gly Glu Thr Val Met Ala Ala Gly Phe 

795 800 805 

caa act ggg ctg egg aaa aat ate agt ctg acc gaa ggt ate tct ctg 4 53 8 
Gin Thr Gly Leu Arg Lys Asn He Ser Leu Thr Glu Gly He Ser Leu 
810 815 820 

gca cat gag gcc tgg tac agt gaa acc cga ctg aat tea cag cat gca 4 586 
Ala His Glu Ala Trp Tyr Ser Glu Thr Arg Leu Asn Ser Gin His Ala 
825 830 835 

gtg ctg gat ggc acg ctg gac ctt tct gcc ggg ata ctg cat ggg aca 4634 
Val Leu Asp Gly Thr Leu Asp Leu Ser Ala Gly He Leu His Gly Thr 
840 845 850 

gac age acg age ggt aac act gag cag gtg aca tac aac gac gga ttt 4 682 
Asp Ser Thr Ser Gly Asn Thr Glu Gin Val Thr Tyr Asn Asp Gly Phe 
855 860 865 870 

tec gcg agt ctg tgg cgt aac cat acg gaa agt gat gcc tgt agt ggt 473 0 
Ser Ala Ser Leu Trp Arg Asn His Thr Glu Ser Asp Ala Cys Ser Gly 

875 880 885 

cgt cat cca cag tea gtg cat gcc agt atg acc tgc cag act teg atg 4778 
Arg His Pro Gin Ser Val His Ala Ser Met Thr Cys Gin Thr Ser Met 
890 895 900 

aac gcc tec ctg teg gtt teg gtg ggg aac tgg tat gcc eta ctg gga 4826 
Asn Ala Ser Leu Ser Val Ser Val Gly Asn Trp Tyr Ala Leu Leu Gly 
905 910 915 

tac agt acc age agg aca gaa ggt egg ccg gtt tac egg gga tat gat 4874 
Tyr Ser Thr Ser Arg Thr Glu Gly Arg Pro Val Tyr Arg Gly Tyr Asp 
920 925 930 

gat aac agt gac aaa gaa aat gtg ttc tgg cga cag gca tac ate cct 4 922 
Asp Asn Ser Asp Lys Glu Asn Val Phe Trp Arg Gin Ala Tyr He Pro 
935 940 945 950 
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gcc tct cac cgc gaa tct. .get cag get agt gca acg tac age ctt aat 4970 
Ala Ser His Arg Glu Ser Ala Gin Ala Ser Ala Thr Tyr Ser Leu Asn 

955 960 965 

atg get ggc atg aat att aat acc cat ggg gga gta tgg cga acc cga 5*018 
Met Ala Gly Met Asn lie Asn Thr His Gly Gly Val Trp Arg Thr Arg 
970 975 980 

aat gac gga gtg .aat gat gat ggc ttg ttt atg agt gtc agt gtg tea 5066 
Asn Asp Gly Val Asn Asp Asp Gly Leu Phe Met Ser Val Ser Val Ser 
985 990 995 

tat gcc tct caa cca ccg aca atg act ggc agt aat agg tat acc tea 5114 
Tyr Ala Ser Gin Pro Pro Thr Met Thr Gly Ser Asn Arg Tyr Thr Ser 
1000 1005 1010 



gcc ggg acc gat att cac agt age egg aat caa aaa aca cag acg tec 
Ala Gly Thr Asp lie His Ser Ser Arg Asn Gin Lys Thr Gin Thr Ser 
1015 1020 1025 1030 

tgg aat gtg aac cat gtg aga tec tgg cag cag gat ctg tat cgt gaa 
Trp Asn Val Asn His Val Arg Ser Trp Gin Gin Asp Leu Tyr Arg Glu 
1035 1040 1045 

ctg teg gtg ggt ttc tec ggt tat aac gac gac age tgg age ggg agt 
Leu Ser Val Gly Phe Ser Gly Tyr Asn Asp Asp Ser Trp Ser Gly Ser 
1050 1055 1060 

etc ggc gga cgc atg age ggc cgt atg ggt gaa ctg age gcc act ate 
Leu Gly Gly Arg Met Ser Gly Arg Met Gly Glu Leu Ser Ala Thr lie 
1065 1070 1075 

agt aac tec cat caa cgt aat gcg ggc age gcc agt tea etc acc get 
Ser Asn Ser His Gin Arg Asn Ala Gly Ser Ala Ser Ser Leu Thr Ala 
1080 1085 1090 

ggc tac age teg tct ctg gcg tta tec cgt aat gga ctg ttc tgg gga 
Gly Tyr Ser Ser Ser Leu Ala Leu Ser Arg Asn Gly Leu Phe Trp Gly 
1095 1100 1105 1110 

ggt ggt cag gac ggt gaa ccg gcc tct ggc atg gcg gtg aac gtg gag 
Gly Gly Gin Asp Gly Glu ^Pro Ala Ser Gly Met Ala Val Asn Val Glu 
1115 1120 1125 



age age cag ccg ttc agt etc ggt ttt ggt cag cag teg ctg ttg ctg 
Ser Ser Gin Pro Phe Ser Leu Gly Phe Gly Gin Gin Ser Leu Leu Leu 
1145 1150 1155 



agt tea cag ggt atg gca ggc gta aaa gcg gga ggg gga age agg tgt 
Ser Ser Gin Gly Met Ala Gly Val Lys Ala Gly Gly Gly Ser Arg Cys 



516 2. 



5210 



5258 



5306 



5354 



5402 



5450 



tea gag ggg gac gag ggc agt age ggg aaa gta gtc age gtt cgt ggc 5498 
Ser Glu Gly Asp Glu Gly Ser Ser Gly Lys Val Val Ser Val Arg Gly 
1130 1135 1140 



5546 



atg gaa ggc tat aac gcc acg gag gtg acc att gag gat gca ggg gtt 5594 
Met Glu Gly Tyr Asn Ala Thr Glu Val Thr lie Glu Asp Ala Gly Val 
1160 1165 1170 



5642 



56 




1175 11-8CU 1185 1190 

tac ttc ctg aca ccc ggg cat ctg ctg gtt cac aac ate age gec agt 5690 
Tyr Phe Leu Thr Pro Gly His Leu Leu Val His Asn lie Ser Ala Ser 
1195 1200 1205 

atg age cga ctg tac gtt ggc cgc gta ctg gac aag gat ggc aga ccg 573 8 
Met Ser Arg Leu Tyr Val Gly Arg Val Leu Asp Lys Asp Gly Arg Pro 
1210 1215 1220 

ctg ctg gac gca cag cca ctg aac tat cca ttt ttg teg ttg gga cct 578 6 
Leu Leu Asp Ala Gin Pro Leu Asn Tyr Pro Phe Leu Ser Leu Gly Pro 
1225 1230 1235 

tec ggg cga ttt age ctg cag age gag cat aaa gaa tec age ctg tgg 583 4 
Ser Gly Arg Phe Ser Leu Gin Ser Glu His Lys Glu Ser Ser Leu Trp 
1240 1245 1250 

ctg ctg tct aa.a aac agg ate ctg cgt tgt ccg atg tea gta cat aaa 5882 
Leu Leu Ser Lys Asn Arg lie Leu Arg Cys Pro Met Ser Val His Lys 
1255 1260 1265 1270 

cgt egg gat gtt atg cag gta gtg ggt gat gtg egg tgt gaa tta agt 593 0 
Arg Arg Asp Val Met Gin Val Val Gly Asp Val Arg Cys Glu Leu Ser 
1275 1280 1285 

gac gtg gat gee ctg cca cag gcg ttg caa ata teg ccg egg gtc ate 5978 
Asp Val Asp Ala Leu Pro Gin Ala Leu Gin lie Ser Pro Arg Val lie 
1290 1295 1300 



cgt ttg ctg aac gtg gca ggt ttg ctg cgc cat tec gtt cag gaa gee 
Arg Leu Leu Asn Val Ala Gly Leu Leu Arg His Ser Val Gin Glu Ala 
1305 1310 1315 



6026 



tga cgtagagata aaggcgttaa ct atg agt aat aaa atg aag tgg acg agt 6078 

Met Ser Asn Lys Met Lys Trp Thr Ser 
1320 1325 

atg aca gee cat tgg tea gca att att aat ttc ate cga aaa tat gtt 6126 
Met Thr Ala His Trp Ser Ala He He Asn Phe He Arg Lys Tyr Val 
1330 1335 1340 

tat cca gca agg ata att gee ate ctg ctg atg get ggc get aca ctg 
Tyr Pro Ala Arg lie He Ala He Leu Leu Met Ala Gly Ala Thr Leu 
1345 1350 1355 1360 

cca caa gtc gee gat gcg att ace gtc gac ctg aat tac gac aag aac 
Pro Gin Val Ala Asp Ala He Thr Val Asp Leu Asn Tyr Asp Lys Asn 
1365 1370 1375 

aat gta gcg gtc ate act cct gtc tgg tec caa gaa tgg agt gta gca 
Asn Val Ala Val He Thr Pro Val Trp Ser Gin Glu Trp Ser Val Ala 
1380 1385 1390 

aat gtg ttg ggg gga tgg gta tgt cgt tea aac agg aat gaa aat gag 6318 
Asn Val Leu Gly Gly Trp Val Cys Arg Ser Asn Arg Asn Glu Asn Glu 
1395 1400 1405 

999 9<=g tgt gaa gaa aca cat ttg gta tgg tgg tat get ttt gga get 6366 
Gly Ala Cys Glu Glu Thr His Leu Val Trp Trp Tyr Ala Phe Gly Ala 



6174 



6222 



6270 



1410 - 1415 1420 

tat tea aaa att cgt ctg cgt ttc aga gaa caa ate age cat gee gaa 6414 
Tyr Ser Lys lie Arg Leu Arg Phe Arg Glu Gin lie Ser His Ala Glu 
1425 1430 1435 1440 

att acg etc ata ctg etc ggc agt gtt cgt gat gec tgt tat act ggt 6462 
lie Thr Leu He Leu Leu Gly Ser Val Arg Asp Ala Cys Tyr Thr Gly 
1445 1450 1455 

gtc ate aac atg aac get get gca tgt caa tgg ggt agg teg ctg aaa 6510 
Val He Asn Met Asn Ala Ala Ala Cys Gin Trp Gly Arg Ser Leu Lys 
1460 1465 1470 

ctt agg ata cct tea gaa gag ctt gcg aag ata cct aca age gga aca 6558 
Leu Arg He Pro Ser Glu Glu Leu Ala Lys He Pro Thr Ser Gly Thr 
1475 14S0 1485 

tgg aaa gca acg tta gtc ctg gat tat tta caa tgg ggc gga gac gat 6606 
Trp Lys Ala Thr Leu Val Leu Asp Tyr Leu Gin Trp Gly Gly Asp Asp 
1490 1495 1500 



cct tta ggc aca tea act aca gat ate acg ctg aat gta aca gac cac 6654 
Pro Leu Gly Thr Ser Thr Thr Asp He Thr Leu Asn Val Thr Asp His 
1505 1510 1515 1520 

ttt get gaa aat gcg get att tac ttt ccg caa ttt ggt aca gca acg 6702 
Phe Ala Glu Asn Ala Ala lie Tyr Phe Pro Gin Phe Gly Thr Ala Thr 
1525 1530 1535 

ccc egg gtg gac ctg aat ctt cac egg atg aat gee tea caa atg teg 6750 
Pro Arg Val Asp Leu Asn Leu His Arg Met Asn Ala Ser Gin Met Ser 
1540 1545 1550 

ggc agg get aat ctg gat atg tgt ctg tat gac gga ggt gtg aaa gec 6798 
Gly Arg Ala Asn Leu Asp Met Cys Leu Tyr Asp Gly Gly Val Lys Ala 
1555 1560 1565 

cgt tea tta cag atg aag ata gaa gga age aat aag tea ggt acg gga 684 6 
Arg Ser Leu Gin Met Lys He Glu Gly Ser Asn Lys Ser Gly Thr Gly 
1570 1575 1580 

ttt cag gtt ata aag age gat tct get gat acg att gat tat gcg gtc 6894 
Phe Gin Val He Lys Ser Asp Ser Ala Asp Thr He Asp Tyr Ala Val 
1585 1590 1595 1600 

agt atg aat tat ggg gga cga agt att cct gtc ace cgt ggc gtg gag 6942 
Ser Met Asn Tyr Gly Gly Arg Ser He Pro Val Thr Arg Gly Val Glu 
1605 1610 1615 

ttc agt ctg gat aac gtg gat aaa gca gca acg cgt ccg gtg gta ctt 6990 
Phe Ser Leu Asp Asn Val Asp Lys Ala Ala Thr Arg Pro Val Val Leu 
1620 1625 1630 

ccc ggg caa egg cag gcg gta cgt tgt gtg cca gtg ccc ctt ace ctg 7038 
Pro Gly Gin Arg Gin Ala Val Arg Cys Val Pro Val Pro Leu Thr Leu 
1635 1640 1645 



aca aca caa ccc ttt aracr ate aga gag aag cgt tct ggt gag tat cag 708 6 
Thr Thr Gin Pro Phe Asn He Arg Glu Lys Arg Ser Gly Glu Tyr Gin 
1650 1655 1660 

gga acg ctg aca gtg aca atg ctg atg gga aca caa acc ccc tga 7131. 

Gly Thr Leu Thr Val Thr Met Leu Met Gly Thr Gin Thr Pro 
1665 1670 .1675 

cagtaattat ttattttatt gatatctttc ttatatggtt ttttaaatca gagttctctt 7191 

tatatacttg ttttatttaa taaagagaat ctattcactt atgaaaatca atgcgtgagg 7251 

ttctgettte ct atg act gtg tat tta gat gat aaa gat aaa gaa tta ttg 7302 
Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu 
1660 1685 1690 

aaa gaa ate caa aaa gat tgt gca caa act tta tgg.caa ctt gca tat 73 50 
Lys Glu lie Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr 
1695 1700 1705 

aaa gtg gga ctt acg ccc aca cca tgt ttc aaa cgt tta aaa aaa ctt 73 9S 
Lys Val Gly Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu 
1710 1715 1720 

aaa gac agg ggg gtt ate att ggt cag ttc get tta ttg gat aag gaa 7446 
Lys Asp Arg Gly Val He He Gly Gin Phe Ala Leu Leu Asp Lys Glu 
1725 1730 1735 1740 

aaa eta ggt ctt tea ctt aat gtc ttt att atg att aac ata tct gag. 7494 
Lys Leu Gly Leu Ser Leu Asn Val Phe He Met He Asn He Ser Glu 
1745 1750 1755 

gag caa tac get agt att tct gag aaa ata aag tea atg cct gag gtt 7542 
Glu Gin Tyr Ala Ser He Ser Glu Lys lie Lys Ser Met Pro Glu Val 
1760 1765 1770 

att gec ttt tat cga att tct gga tea ttt aat tat tta atg cat aca 7590 
He Ala Phe Tyr Arg He Ser Gly Ser Phe Asn Tyr Leu Met His Thr 
1775 1780 1785 

gta ttt aca gat atg aac gat tac tat agt ttt tat gag aaa ata ata 7638 
Val Phe Thr Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys He He 
1790 1 ; 795 1800 

tta act aat tct tea att agt gga tct gca teg age ttt gtt ctt gag 7686 
Leu Thr Asn Ser Ser lie Ser Gly Ser Ala Ser Ser Phe Val Leu Glu 
1805 1810 1815 1820 

caa ata aag gaa aca aac gaa ctg tea gtg tga aagtgtgatg tgtacttact 773 9 
Gin He Lys Glu Thr Asn Glu Leu Ser Val 
1825 1830 

gatttaatac attattatcc ttcttacgga acaacaaegg cagattgegg ctgttgaaca 7799 

aggattttaa tcagcagtgg tgaaattaag eggcacagaa taacacagcg gaatatcaca 7859 

tggttaaata tcaccccgtg catgtaacaa aaaacegcat taaaacagat gatgttactg 7919 



atatttattt cgttgaaccc ttctggaaaa aaggcgaaaa ccacataatt gagtcattga 7979 



tgttttttga 


agagttacaa- 


- aagtcattta 


atttattcaa 


ccataaatat 


gggttaaata 


8039 


aatatatact 


caggatcccc 


tgggaatttg 


tgctcataca 


tatggaaagg 


atcagtaaat 


8099 


taaatagcgt 


cgggttattt 


gctgtttctg 


ttgactttaa 


taacaaccac 


aaatttctga 


8159 


gcgagtacat 


caggagtcgc 


agagattatg 


gtatggaagt 


ttggtttgat 


ttttgtggta 


8219 


aacattctta 


ttccagtgaa 


attaaaaacc 


ttggattctt 


ttttcaggct 


tgcgtagtgc 


8279 


ctcgtgatcc 


taattttatt 


agtagtgttt 


atcattatca 


taagttccaa 


aagattcttg 


8339 


tcggggatat 


aaatgatgta 


gaacagaggg 


ccgtgtacca 


gaacgaagtt 


gattacatgc 


8399 


atggaatgca 


atggccatcg 


tcatatgacg 


gttttttctt 


tcgggatcat 


aaaaaaaatg 


8459 


aaacttggtg 


tatataacag 


aaggagtgaa 


aatttgaatc 


aaaaatatct 


tatttatttt 


8519 


ttgtttaatc 


attgttttgt 


tttttattac 


gattaaatat 


aaagaacatc 


attgttcgtg 


8579 


cggtggggag 


gctggaagtt 


taggggatga 


ccgtttatca 


acaattttat 


tacagccacc 


8639 


atacgaatgg 


tttatatatg 


cactagatgt 


attattttag 


tttaatatat 


cgatggttgc 


8699 


tatttgcatt 


gatgatgttc 


cgttacatta 


aggaatatac 


atctgtatct 


cgttatacgc 


8759 


acactcacat 


tactaatcat 


tattaatatg 


agtgtggttc 


ttgttttacg 


catgcatggt 


8819 


tgcatgtgac 


gttaaattta 


aatgagctga 


ctgtatgaat 


tctaaatact 


ttagagaggt 


8879 


gttttttgtc 


tcggtagttg 


ttatattatt 


attttatttg 


gtgttatttg 


cagccagtgc 


8939 


tcatgctgaa 


ggcggtttca 


gatctggagg 


cattgggt ta 


tttatgacgg 


gaacaagaga 


8999 


gatgctactg 


tagagataat 


aaattctgct 


aaagattccc 


caattcttgt 


gcattgacat 


9059 


cctccacgtc 


ctgaagggcg 


tgggttcctg 


ctccaacggg 


ctgcctgact 


gcacgctcct 


9119 


tccacaggca 


agcacggcgt 


gtcccgctct 


aaaatgttac 


gcgcgccgtt 


tacatcggcg 


9179 


ttcgcagtat 


atcttcatac 


cagacacttg 


taagtatctc 


gcataatcgt 


gccattcaca 


9239 


tttagagatc 


atac 










9253 



<210> 2 
<211> 236 
<212> PRT 

<213> Salmonella typhi 
<400> 2 

Met Asn Phe Lys Asp Thr Leu Pro Gly Val Phe Leu Cys Val Ala Met 
1 5 10 15 

Phe Ala Cys Gly His Ala Arg Ala Asn Met Leu Val Tyr Pro Met Ala 
20 25 30 

Ala Glu lie Asn Ser Ser Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr 
35 40 45 



% 



60 



Ser Lys Ser Asp His Val ^Gln Tyr He Arg Thr 
SO 55 



Arg lie Met Arg He 
60 



Glu His Pro Gly Met Pro Gin Glu Lys Glu Val 
65 70 75 

He Glu Thr Gly Leu Val Val Ser Pro Glu Lys 

85 90 



Pro Ala Gly Asn Asp 

80 

Phe Ala Leu Ser Pro 
95 



Gly Thr Lys Lys Thr He Arg Val He Ser Thr 
100 105 



Gin Ala Pro Glu Arg 
110 



Glu Glu Ala Trp Arg Val Tyr Phe Glu Ala Val 
115 120 



Pro Glu Leu Glu Asp 
125 



Asp Pro Gin Ala Gly Gly Lys Gin Asn Ser Ser 
130 135 



Val Ser Val Asn Leu 
140 



Val Trp Gly Val Leu Leu Arg Val Ser Pro Ser 
145 150 155 



Asp Pro Arg Pro Ala 

160 



Leu Val Thr Asp Gly His His Leu Leu Asn Thr 

165 170 



Gly Asn Thr Arg Leu 
175 



Ser Leu He Arg Ala Gly Asn Cys Asp Thr Thr 
180 185 



Cys His Trp Gin Asn 
190 



He Gly Lys Ser He Tyr Pro Gly Gly Ser Ala 
195 200 



Asp He Pro Ala Gly 
205 



He Lys Ser Asn Ala Phe Arg Val Glu Tyr Arg 
210 215 



Thr Gly Ala Asn Ser 
220 



Pro Val He Ser Ala Asp Leu Thr Ala Ala Gly 
225 230 235 



Lys 



<210> 3 
<211> 191 
<212> PRT 

<213> Salmonella typhi 
<400> 3 

Met Tyr Thr Glu Cys Thr Tyr He Thr Val He Asn Asn Lys Ala Arg 
15 10 15 

Leu Phe Phe Met Asn Met Lys Thr Ser Phe He Ala Ala Ala Val Ala 
20 25 30 

Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala Val Gin Lys Asp He 
35 40 45 

Thr Val Thr Ala Asn He Asp Ser Thr Leu Glu Leu Leu Gin Ala Asp 
50 55 60 

Gly Ser Ser Leu Pro Ser Thr Met Lys Leu Asp Phe Met Pro Gly Lys 
65 70 75 80 



Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu Tyr Ser Asn Asp Gin 

85 90 95 



Thr Lys Ser Val 
100 

Val Leu Asp Pro 
115 

Arg Ser Leu Thr 
130 

Pro Asp Gly Lys 
145 

Gly Gin Lys Ala 



Ser Gly Leu Val 
180 



% 



Asn Val Lys Leu 



Thr Lys Thr lie 
120 

Thr Thr Asn Ser 
135 

Thr. Gly Asp Ala 
150 

Gly Ala Ala Leu 
165 

Ser Leu Val lie 



61 



Leu Asn Ala Pro 
105 

Asp Met Glu Val 



Val Leu Glu Ala 
140 

Ser Ala Leu Leu 
155 

Gin Asn Leu Pro 
170 

Ser Gin Ala Val 
185 



Gin Leu lie Asn 
110 

Thr Leu Gly Gly 
125 

Lys Thr Leu Phe 



Asn Leu Asp He 
160 

Ala Gly Glu Tyr 
175 

Thr Ala Gly 
190 



<210> 4 
<211> 889 
<212> PRT 

<213> Salmonella typhi 



<400> 4 

Met Tyr Tyr Leu 
1 

He Pro Pro Pro 
20 

He Glu Val Leu 
35 



Leu Gly Leu Cys 
5 

Gly Phe Glu Ser 

Leu Pro Gly His 
40 



Ser Phe Thr Ser 
10 

Leu Leu Glu Gly 
25 

Ser Leu Gly Leu 



Gin Ala Thr Leu 
15 

Gin Thr Glu Gin 
30 

Phe Pro Val Val 
45 



Val Lys Pro Asp Thr Val Gin Phe Met Ser Pro Leu Met Val Leu Glu 

50 55 60 

Ser Ser Gly Leu Ala Ala Leu Pro Ala Ala Glu Arg Gin Lys Ala Leu 

6S 70 75 80 

Ala Ala Leu Ser Arg Pro Leu Leu Arg Asn Ser Asn Leu Val Cys Gly 

85 90 95 



Val Ser Glu Ala Lys 
100 

Lys Glu Asp Val Ala 
115 

Leu Phe Leu Asn Arg 
130 

Arg Trp Leu Thr Pro 
145 

Gin Thr Leu Tyr Leu 

165 



Asp Ser Ser Glu Cys 
105 

Val He Phe Asp Glu 
120 

Asp Trp Leu Pro Asp 
135 

Thr Pro Glu Gly Val 
150 

Ser Asp Asp Leu His 

170 



Gly Tyr Val Ala Thr Asp 
110 

Asn Asn Ala Gin Leu Ser 
125 

Glu Glu Arg Arg Asp Lys 
140 

Ser Ala Phe He His Arg 
155 160 

Ser Arg Asn Met Thr Leu 

175 



Asn Gly Ser Gly 
180 

Asp Trp Ala Ala 
195 

Ala Trp Phe Asp 
210 

Tyr Leu Gin Ala 
225 

Gly Gly Asp Phe 



Leu Arg Thr Gly 
260 

Asn Ala Thr Pro 
275 

lie Tyr Arg Gly 
290 

Met His Thr Leu 
305 



Ala Leu.. Gly Leu 



lie Trp Asn Gin 
200 

Asn Leu Phe Val 
215 

Gly Arg Met Asp 
230 

Gly Phe Ser Leu 
245 

Thr Thr Gin Ala 



Val Met Val Gin 
280 

Ser Glu Leu Leu 
295 

Asp Thr His Ser 
310 



62 



Gly Asp Gly Arg 
185 

Ser Glu His Tyr 



Arg Gin Asp Leu 
220 

Gin Arg Asn Leu 
235 

Leu Pro Leu Ser 
250 

Tyr Val Asn His 
265 

Val Thr Arg Asn 



Gly Ser Gin Phe 
300 

Leu Pro Pro Gly 
315 



Tyr Leu Gly Gly 
190 

Asn Asn Ser Gin 
205 

Gly Asn Gin Tyr 



Ser Ser Ala Thr 
240 

Arg Phe Asp Gly 
255 

Glu Val Asp His 
270 

Ala Arg lie Asp 
285 

Leu Thr Pro Gly 



Ser Tyr Pro Leu 
320 



Ala Leu Arg Val Tyr 

325 

Pro Phe Ser Lys Gly 
340 

lie Gin Gly Gly Leu 
355 

Gly Glu Thr Val Met 
370 

lie Ser Leu Thr Glu 
385 

Glu Thr Arg Leu Asn 

405 

Leu Ser Ala Gly lie 
420 

Glu Gin Val Thr Tyr 
435 

His Thr Glu Ser Asp 
450 

Ala Ser Met Thr Cys 
465 



Glu Asp Gly lie Leu Arg 

330 

Gly Asn Ser Phe Ser Ala 
345 

Glu Asp Thr Gly Asp Lys 
360 

Ala Ala Gly Phe Gin Thr 
375 

Gly lie Ser Leu Ala His 
390 395 

Ser Gin His Ala Val Leu 

410 

Leu His Gly Thr Asp Ser 
425. 

Asn Asp Gly Phe Ser Ala 
440 

Ala Cys Ser Gly Arg His 
455 

Gin Thr Ser Met Asn Ala 
470 475 



Arg Thr Glu Thr Gin 
335 

Gin Thr Gin Trp Phe 
350 

Ala Ser His Tyr Asp 
365 

Gly Leu Arg Lys Asn 
380 

Glu Ala Trp Tyr Ser 

400 

Asp Gly Thr Leu Asp 
415 

Thr Ser Gly Asn Thr 
430 

Ser Leu Trp Arg Asn 
445 

Pro Gin Ser Val His 
460 

Ser Leu Ser Val Ser 

480 



Val Gly Asn Trp Tyr Ala Leu Leu Gly Tyr Ser Thr Ser Arg Thr Glu 

485 490 495 



63 



Gly Arg Pro Val Tyr Arg. Gly Tyr Asp Asp 
500 505 



Asn Ser Asp Lys Glu Asn 
510 



Val Phe Trp Arg Gin Ala Tyr lie Pro Ala 
515 520 



Ser His Arg Glu Ser Ala 
525 



Gin Ala Ser Ala Thr Tyr Ser Leu Asn Met 
530 535 



Ala Gly Met Asn lie Asn 
540 



Thr His Gly Gly Val Trp Arg Thr Arg Asn 
545 550 



Asp Gly Val Asn Asp Asp 
555 560 



Gly Leu Phe Met Ser Val Ser Val Ser Tyr 
565 570 



Ala Ser Gin Pro Pro Thr 
575 



Met Thr Gly Ser Asn Arg Tyr Thr Ser Ala 
580 585 



Gly Thr Asp lie His Ser 
590 



Ser Arg Asn Gin Lys Thr Gin Thr Ser Trp 
595 600 



Asn Val Asn His Val Arg 
605 



Ser Trp Gin Gin Asp Leu Tyr Arg Glu Leu 
610 615 



Ser Val Gly Phe Ser Gly 
620 



Tyr Asn Asp Asp Ser 
625 

Arg Met Gly Glu Leu 
645 

Ala Gly Ser Ala Ser 
660 

Leu Ser Arg Asn Gly 
675 

Ala Ser Gly Met Ala 
690 

Ser Gly Lys Val Val 
705 

Gly Phe Gly Gin Gin 
725 

Glu Val Thr He Glu 
740 

Val Lys Ala Gly Gly 
755 

Leu Leu Val His Asn 
770 



Trp Ser Gly Ser Leu 
630 

Ser Ala Thr He Ser 

650 

Ser Leu Thr Ala Gly 
665 

Leu Phe Trp Gly Gly 
680 

Val Asn Val Glu Ser 
695 

Ser Val Arg Gly Ser 
710 

Ser Leu Leu Leu Met 

730 

Asp Ala Gly Val Ser 
745 

Gly Ser Arg Cys Tyr 
760 

He Ser Ala Ser Met 
775 



Gly Gly Arg Met Ser Gly 
635 640 

Asn Ser His Gin Arg Asn 
655 

Tyr Ser Ser Ser Leu Ala 
670 

Gly Gin Asp Gly Glu Pro 
685 

Glu Gly Asp Glu Gly Ser 
700 

Ser Gin Pro Phe Ser Leu 
715 720 

Glu Gly Tyr Asn Ala Thr 
735 

Ser Gin Gly Met Ala Gly 
750 

Phe Leu Thr Pro Gly His 
765 

Ser Arg Leu Tyr Val Gly 
780 



Arg Val Leu Asp 
785 

Asn Tyr Pro Phe 



Lys Asp Gly Arg 
790 

Leu Ser Leu Gly 
805 



Pro Leu Leu Asp 
795 

Pro Ser Gly Arg 
810 



Ala Gin Pro Leu 
800 

Phe Ser Leu Gin 
815 



% . . . * 

Ser Glu His Lys Glu S^Ser Leu Trp Leu Leu Ser Lys Asn Arg He 
820 825 830 

Leu Arg Cys Pro Met Ser Val His Lys Arg Arg Asp Val Met Gin Val 
835 840 845 

Val Gly Asp Val Arg Cys Glu Leu Ser Asp Val Asp Ala Leu Pro Gin 
850 855 860 

Ala Leu Gin He Ser Pro Arg Val He Arg Leu Leu Asn Val Ala Gly 
865 870 875 880 

Leu Leu Arg His Ser Val Gin Glu Ala 

885 



<210> 5 
<211> 359 
<212> PRT 

<213> Salmonella typhi 
<400> 5 

Met Ser Asn Lys Met Lys Trp Thr Ser Met Thr Ala His Trp Ser Ala 
1 5 io is 

He He Asn Phe He Arg Lys Tyr Val Tyr Pro Ala Arg He He Ala 
20 25 30 

He Leu Leu Met Ala Gly Ala Thr Leu Pro Gin Val Ala Asp Ala He 
35 40 45 

Thr Val Asp Leu Asn Tyr Asp Lys Asn Asn Val Ala Val He Thr Pro 
50 55 60 

Val Trp Ser Gin Glu Trp Ser Val Ala Asn Val Leu Gly Gly Trp Val 
65 70 75 80 

Cys Arg Ser Asn Arg Asn Glu Asn Glu Gly Ala Cys Glu Glu Thr His 

85 90 95 

Leu Val Trp Trp Tyr Ala Phe Gly Ala Tyr Ser Lys He Arg Leu Arg 
100 105 110 

Phe Arg Glu Gin He Ser His Ala Glu He Thr Leu He Leu Leu Gly 
115 120 125 

Ser Val Arg Asp Ala Cys Tyr Thr Gly Val He Asn Met Asn Ala Ala 
130 135 140 

Ala Cys Gin Trp Gly Arg Ser Leu Lys Leu Arg He Pro Ser Glu Glu 
145 150 155 160 

Leu Ala Lys He Pro Thr Ser Gly Thr Trp Lys Ala Thr Leu Val Leu 

165 170 175 

Asp Tyr Leu Gin Trp Gly Gly Asp Asp Pro Leu Gly Thr Ser Thr Thr 
180 185 190 



Asp He Thr Leu Asn Val Thr Asp His Phe Ala Glu Asn Ala Ala He 
195 200 205 



65 



Tyr Phe Pro 
210 



Gin Phe Gly Thr Ala Thr 
215 



Pro Arg Val Asp Leu Asn Leu 
220 



His Arg Met 
225 

Cys Leu Tyr 
Glu Gly Ser 



Asn Ala Ser Gin Met Ser 
230 



Asp Gly 
245 

Asn Lys 
260 



Gly Val Lys Ala 



Ser Gly Thr Gly 
265 



Gly Arg Ala Asn Leu Asp Met 
235 240 

Arg Ser Leu Gin Met Lys He 
250 255 

Phe Gin Val He Lys Ser Asp 

270 



Ser Ala Asp 
275 



Thr He Asp Tyr Ala Val 

280 



Ser Met Asn Tyr Gly Gly Arg 
285 



Ser lie Pro 
290 



Val Thr Arg Gly Val Glu 
295 



Phe Ser Leu Asp Asn Val Asp 
300 



Lys Ala Ala 
305 

Arg Cys Val 
Arg Glu Lys 



Thr Arg Pro Val Val Leu 
310 



Pro Val 
325 

Arg Ser 
340 



Pro Leu Thr Leu 



Gly Glu Tyr Gin 
345 



Pro Gly Gin Arg Gin Ala Val 
315 320 

Thr Thr Gin Pro Phe Asn He 
330 335 

Gly Thr Leu Thr Val Thr Met 

350 



Leu Met Gly 
355 



Thr Gin Thr Pro 



<210> 6 ' 
<211> 151 
<212> PRT 

<213> Salmonella typhi 
<400> 6' 

Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu Lys Glu He 
15 10 15 

Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr Lys Val Gly 
20 25 30 

Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu Lys Asp Arg 
35 40 45 



Gly Val He lie Gly Gin Phe Ala Leu Leu Asp Lys Glu Lys Leu Gly 
50 55 60 



Leu Ser Leu Asn Val Phe He Met He Asn He Ser Glu Glu Gin Tyr 
65 70 75 80 

Ala Ser He Ser Glu Lys He Lys Ser Met Pro Glu Val He Ala Phe 

85 90 95 



Tyr Arg He Ser Gly Ser Phe Asn Tyr Leu Met His Thr Val Phe Thr 
100 , 105 110 



^ 66 . ^ 

Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys lie lie Leu Thr Asn 
115 " ^ 120 125 

Ser Ser He Ser Gly Ser Ala Ser Ser Phe Val Leu Glu Gin He Lys 
130 135 140 



Glu Thr Asn Glu Leu Ser Val 
145 150 
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